2015-03-31 96 views


enter image description here


enter image description here



我觉得这是一个有趣的问题,但它对于SO来说太开放和基于观点。 (而且它也不是真的关于编程。)也许这将是交叉验证的主题? – Gregor 2015-03-31 21:53:08


只是为了确保我们谈论的是同样的事情:您想要通过考虑实现所述概率分布的直方图来可视化概率密度函数,对吗?因为累积分布函数是非常不同的... – jhin 2015-03-31 22:34:59


示例数据集将会很好。 – jhin 2015-03-31 22:35:09






## function that replicates default ggplot2 colors 
## taken from [1] 
gg_color_hue <- function(n) { 
    hues = seq(15, 375, length=n+1) 
    hcl(h=hues, l=65, c=100)[1:n] 

## Set up sample data 
n <- 2000 
x1 <- rlnorm(n, 0, 1) 
x2 <- rlnorm(n, 0, 1.1) 
df <- bind_rows(data.frame(sample=1, x=x1), data.frame(sample=2, x=x2)) %>% 
    mutate(sample = as.factor(sample)) 

## Calculate density estimates 
g1 <- ggplot(df, aes(x=x, group=sample, colour=sample)) + 
    geom_density(data = df) + xlim(0, 10) 
gg1 <- ggplot_build(g1) 

## Use these estimates (available at the same x coordinates!) for 
## calculating the differences. 
## Inspired by [2] 
x <- gg1$data[[1]]$x[gg1$data[[1]]$group == 1] 
y1 <- gg1$data[[1]]$y[gg1$data[[1]]$group == 1] 
y2 <- gg1$data[[1]]$y[gg1$data[[1]]$group == 2] 
df2 <- data.frame(x = x, ymin = pmin(y1, y2), ymax = pmax(y1, y2), 
        side=(y1<y2), ydiff = y2-y1) 
g2 <- ggplot(df2) + 
    geom_ribbon(aes(x = x, ymin = ymin, ymax = ymax, fill = side, alpha = 0.5)) + 
    geom_line(aes(x = x, y = 5 * abs(ydiff), colour = side)) + 
    geom_area(aes(x = x, y = 5 * abs(ydiff), fill = side, alpha = 0.4)) 
g3 <- g2 + 
    geom_density(data = df, size = 1, aes(x = x, group = sample, colour = sample)) + 
    xlim(0, 10) + 
    guides(alpha = FALSE, colour = FALSE) + 
    ylab("Curves: density\n Shaded area: 5 * difference of densities") + 
    scale_fill_manual(name = "samples", labels = 1:2, values = gg_color_hue(2)) + 
    scale_colour_manual(limits = list(1, 2, FALSE, TRUE), values = rep(gg_color_hue(2), 2)) 


enter image description here

来源:SO answer 1SO answer 2



## function that replicates default ggplot2 colors 
## taken from [1] 
gg_color_hue <- function(n) { 
    hues = seq(15, 375, length=n+1) 
    hcl(h=hues, l=65, c=100)[1:n] 

## Set up sample data 
n <- 2000 
x1 <- rlnorm(n, 0, 1) 
x2 <- rlnorm(n, 0, 1.1) 
df <- bind_rows(data.frame(sample=1, x=x1), data.frame(sample=2, x=x2)) %>% 
    mutate(sample = as.factor(sample)) 

## Calculate density estimates 
g1 <- ggplot(df, aes(x=x, group=sample, colour=sample)) + 
    geom_density(data = df) + xlim(0, 10) 
gg1 <- ggplot_build(g1) 

## Use these estimates (available at the same x coordinates!) for 
## calculating the differences. 
## Inspired by [2] 
x <- gg1$data[[1]]$x[gg1$data[[1]]$group == 1] 
y1 <- gg1$data[[1]]$y[gg1$data[[1]]$group == 1] 
y2 <- gg1$data[[1]]$y[gg1$data[[1]]$group == 2] 
df2 <- data.frame(x = x, ymin = pmin(y1, y2), ymax = pmax(y1, y2), 
        side=(y1<y2), ydiff = y2-y1) 
g2 <- ggplot(df2) + 
    geom_ribbon(aes(x = x, ymin = ymin, ymax = ymax, fill = side, alpha = 0.5)) + 
    geom_density(data = df, size = 1, aes(x = x, group = sample, colour = sample)) + 
    xlim(0, 10) + 
    guides(alpha = FALSE, fill = FALSE) 
g3 <- ggplot(df2) + 
    geom_line(aes(x = x, y = abs(ydiff), colour = side)) + 
    geom_area(aes(x = x, y = abs(ydiff), fill = side, alpha = 0.4)) + 
    guides(alpha = FALSE, fill = FALSE) 
## See [3] 
grid.draw(rbind(ggplotGrob(g2), ggplotGrob(g3), size="last")) 

enter image description here

...或abs(ydiff)通过ydiff在第二情节的建设代替: enter image description here

来源:SO answer 3


由于y尺度不同,所以可能最好在单列中绘制两张图。 – Gregor 2015-04-01 01:07:30


@Gregor喜欢这个? – jhin 2015-04-01 01:57:17


是啊!现在,您不必混淆差异的规模。 – Gregor 2015-04-01 02:04:27