2017-10-22 84 views
0

我有一个时间序列数据集,从1992-2017年。我可以为整个数据点设置颜色,但我想要的是为特定年份范围设置所需的颜色。例如;从1992 - 1995年的“蓝色”,从1995年到2005年的“红色”等。我们该怎么做?如何在一个图中为特定年份值范围指定不同的颜色? (Python)

数据集有2列;年和价值。

import numpy as np 
import pandas as pd 
from scipy import stats 
from sklearn import linear_model 
from matplotlib import pyplot as plt 
import pylab 
import matplotlib.patches as mpatches 
import matplotlib.pyplot as plt 
import seaborn as sns 
from sklearn.linear_model import LinearRegression 

Atlantic = pd.read_csv('C:\\AtlanticEnd.csv', error_bad_lines=False) 

X = Atlantic['year'] 

y = Atlantic['Poseidon'] 

plt.figure(figsize=(20,10)) 
plt.ylabel('Change in mean sea level [mm]', fontsize=20) 
plt.xlabel('Years', fontsize=20) 
plt.title('Atlantic Ocean - Mean Sea Level', fontsize=20) 
colors = ["blue", "red", "green", "purple"] 
texts = ["Poseidon", "Jason1", "Jason2", "Jason3"] 
patches = [ plt.plot([],[], marker="o", ms=10, ls="", mec=None, color=colors[i], 
      label="{:s}".format(texts[i]))[0] for i in range(len(texts)) ] 
plt.legend(handles=patches, loc='upper left', ncol=1, facecolor="grey", numpoints=1) 

plt.plot(X, y, 'ro', color='red') 

slope, intercept, r_value, p_value, std_err = stats.linregress(X, y) 
plt.plot(X, X*slope+intercept, 'b') 

plt.axis([1992, 2018, -25, 80]) 

plt.grid(True) 

plt.show() 

def trendline(Atlantic, order=1): 
    coeffs = np.polyfit(Atlantic.index.values, list(Atlantic), order) 
    slope = coeffs[-2] 
    return float(slope) 

slope = trendline(y) 
print(slope) 

enter image description here

+1

欢迎SO。提供样本数据并告诉我们您做了什么:[最小,完整和可验证示例](https://stackoverflow.com/help/mcve) – skrubber

+0

添加了代码和输出图片。 –

回答

0

我做我自己的随机数据,此功能工作,但假设你有不重叠的日期范围,这应该工作。这也好像你的日期不是pd.datetime类型。这应该适用于pd.datetime类型,但字典中的查找值将类似于("1992-01-01","2000-01-01")等。

# Create data 
data = np.random.rand(260,1) 
dates = np.array(list(range(1992,2018))*10) 

df = pd.DataFrame({"y":data[:,0],"date":dates}) 
df = df.sort(columns="date") 

# Dictionary lookup 
lookup_dict = {(1992,2000):"r", (2001,2006):"b",(2007,2018):"k"} 

# Slice data and plot 
fig, ax = plt.subplots() 
for lrange in lookup_dict: 
    temp = df[(df.date>=lrange[0]) & (df.date<=lrange[1])] 
    ax.plot(temp.date,temp.y,color=lookup_dict[lrange], marker="o",ls="none") 

这将产生:

enter image description here

0

我可以想像,使用颜色表为点的散点图可以是一个简单的解决方案。假设年份以十进制格式给出,那么分散的颜色将仅由年份来定义。 A BoundaryNorm将定义值的范围,并且可以从颜色列表容易地创建颜色映射。

import numpy as np 
import pandas as pd 
import matplotlib.pyplot as plt 
import matplotlib.colors 

y = np.random.rand(300)*26+1992 
d = (3.075*(y-1992)-17)+np.random.normal(0,5,300) 
df = pd.DataFrame({"year" : y, "data" : d}) 

bounds = [1992,1995,2005,2015,2018] 
colors = ["darkorchid", "crimson", "limegreen", "gold"] 
cmap = matplotlib.colors.ListedColormap(colors) 
norm = matplotlib.colors.BoundaryNorm(bounds, len(colors)) 

fig, ax = plt.subplots() 
sc = ax.scatter(df.year, df.data, c=df.year.values, cmap=cmap, norm=norm) 
fig.colorbar(sc, spacing="proportional") 

fit = np.polyfit(df.year.values, df.data.values, deg=1) 
ax.plot(df.year, np.poly1d(fit)(df.year.values), color="k") 

plt.show() 

enter image description here

相关问题