2016-02-29 193 views
-4

我必须在C#中编写这个代码#如何计算2个给定向量之间的Pearson相关性?

你可以在下面给出的例子中逐步解释吗?

vector 1 : [0.3, 0, 1.7, 2.2] 
vector 2 : [0, 3.3, 1.2, 0] 

泰非常

这将在文档聚类中使用

+1

HTTP://计算器。 com/questions/28428365/how-to-find-correlation-between-two-integer-arrays-in-java它是Java的实现,但很容易适应C# –

回答

4

这是对的Java版本

How to find correlation between two integer arrays in java

我的回答适应为C#。首先,皮尔逊相关是

http://en.wikipedia.org/wiki/Correlation_and_dependence

提供了这两个矢量(让他们IEnumerable<Double>)是相同长度的

private static double Correlation(IEnumerable<Double> xs, IEnumerable<Double> ys) { 
    // sums of x, y, x squared etc. 
    double sx = 0.0; 
    double sy = 0.0; 
    double sxx = 0.0; 
    double syy = 0.0; 
    double sxy = 0.0; 

    int n = 0; 

    using (var enX = xs.GetEnumerator()) { 
     using (var enY = ys.GetEnumerator()) { 
     while (enX.MoveNext() && enY.MoveNext()) { 
      double x = enX.Current; 
      double y = enY.Current; 

      n += 1; 
      sx += x; 
      sy += y; 
      sxx += x * x; 
      syy += y * y; 
      sxy += x * y; 
     } 
     } 
    } 

    // covariation 
    double cov = sxy/n - sx * sy/n/n; 
    // standard error of x 
    double sigmaX = Math.Sqrt(sxx/n - sx * sx/n/n); 
    // standard error of y 
    double sigmaY = Math.Sqrt(syy/n - sy * sy/n/n); 

    // correlation is just a normalized covariation 
    return cov/sigmaX/sigmaY; 
    } 

测试:

// -0.539354840012899 
    Double result = Correlation(
    new Double[] { 0.3, 0, 1.7, 2.2 }, 
    new Double[] { 0, 3.3, 1.2, 0 });