Series.corr(other, method=’pearson’, min_periods=None)
Compute correlation with other Series, excluding missing values.
>>> histogram_intersection = lambda a, b: np.minimum(a, b).sum().round(decimals=1) >>> s1 = pd.Series([.2, .0, .6, .2]) >>> s2 = pd.Series([.3, .6, .0, .1]) >>> s1.corr(s2, method=histogram_intersection) 0.3
DataFrame.corr(method=’pearson’, min_periods=1)
Compute pairwise correlation of columns, excluding NA/null values.
>>> histogram_intersection = lambda a, b: np.minimum(a, b).sum().round(decimals=1) >>> df = pd.DataFrame([(.2, .3), (.0, .6), (.6, .0), (.2, .1)], ... columns=['dogs', 'cats']) >>> df.corr(method=histogram_intersection) dogs cats dogs 1.0 0.3 cats 0.3 1.0
也可以指定DataFrame中特定columns做相關性比較
df[['column_1','column_2']].corr()