2014-12-04 58 views
6

我必须绘制并保存从1到500的循环与不同的数据,但由于Matplot lib导致内存泄漏。 有人有任何想法如何处理?Matplotlib简单大小写内存泄漏与熊猫

简单情况下这里:

import sys 
import gc 
import numpy as np 
import matplotlib 
matplotlib.use('Agg') 
import matplotlib.pyplot as plt 
import pandas as pd 

pdindex = pd.date_range(start='01/01/2013', freq='15min', end='01/01/2019') 
df = pd.DataFrame({'test':np.random.normal(0,1,len(pdindex))}, index=pdindex) 


def memplot_plot(df, i): 
    df.test.plot()  
    plt.title('graph' + str(i)) 
    plt.savefig(str(i) + '.png', dpi=144) 
    plt.close() 

for i in range(1, 100): 
    print '*******************************' 
    print 'i : ' + str(i)  
    print len(gc.get_objects()) 
    print sys.getsizeof(gc.get_objects()) 
    memplot_plot(df, i)  
    gc.collect() 

并且输出是(存储器错误,因为我的= 6):

******************************* 
i : 1 
74682 
325680 
******************************* 
i : 2 
290627 
1190248 
******************************* 
i : 3 
506420 
2145012 
******************************* 
i : 4 
721993 
3054204 
******************************* 
i : 5 
937566 
3865524 
******************************* 
i : 6 
1153139 
4892352 
Traceback (most recent call last): 
    File "<stdin>", line 1, in <module> 
    File "C:\Anaconda\lib\site- packages\spyderlib\widgets\externalshell\sitecustomize.py", line 580, in runfile 
    execfile(filename, namespace) 
    File "C:/PERSO/script_backtesting.py", line 124, in <module> 
    memplot_plot(df, i)  
    File "C:/PERSO/script_backtesting.py", line 107, in memplot_plot 
    plt.savefig(str(i) + '.png', dpi=144) 
    File "C:\Anaconda\lib\site-packages\matplotlib\pyplot.py", line 576, in savefig 
    res = fig.savefig(*args, **kwargs) 
    File "C:\Anaconda\lib\site-packages\matplotlib\figure.py", line 1470, in savefig 
    self.canvas.print_figure(*args, **kwargs) 
    File "C:\Anaconda\lib\site-packages\matplotlib\backend_bases.py", line 2192, in  print_figure 
    **kwargs) 
    File "C:\Anaconda\lib\site-packages\matplotlib\backends\backend_agg.py", line 513, in print_png 
    FigureCanvasAgg.draw(self) 
    File "C:\Anaconda\lib\site-packages\matplotlib\backends\backend_agg.py", line 461, in  draw 
    self.figure.draw(self.renderer) 
    File "C:\Anaconda\lib\site-packages\matplotlib\artist.py", line 59, in draw_wrapper 
    draw(artist, renderer, *args, **kwargs) 
    File "C:\Anaconda\lib\site-packages\matplotlib\figure.py", line 1079, in draw 
    func(*args) 
    File "C:\Anaconda\lib\site-packages\matplotlib\artist.py", line 59, in draw_wrapper 
    draw(artist, renderer, *args, **kwargs) 
    File "C:\Anaconda\lib\site-packages\matplotlib\axes\_base.py", line 2092, in draw 
    a.draw(renderer) 
    File "C:\Anaconda\lib\site-packages\matplotlib\artist.py", line 59, in draw_wrapper 
    draw(artist, renderer, *args, **kwargs) 
    File "C:\Anaconda\lib\site-packages\matplotlib\axis.py", line 1103, in draw 
    ticks_to_draw = self._update_ticks(renderer) 
    File "C:\Anaconda\lib\site-packages\matplotlib\axis.py", line 957, in _update_ticks 
    tick_tups = [t for t in self.iter_ticks()] 
    File "C:\Anaconda\lib\site-packages\matplotlib\axis.py", line 903, in iter_ticks 
    self.major.formatter.set_locs(majorLocs) 
    File "C:\Anaconda\lib\site-packages\pandas\tseries\converter.py", line 982, in set_locs 
    self._set_default_format(vmin, vmax) 
    File "C:\Anaconda\lib\site-packages\pandas\tseries\converter.py", line 966, in _set_default_format 
    format = np.compress(info['maj'], info) 
    File "C:\Anaconda\lib\site-packages\numpy\core\fromnumeric.py", line 1563, in compress 
    return compress(condition, axis, out) 
MemoryError 

图的存储器,用于从发射计算机脚本breack的和杀死控制台。 enter image description here

+0

我能够在Window 7 64-bit,pandas 0.15.1和matplotlib 1.4.0上重现这一点。如果您重新配置绘图,以便设置一个图形和坐标轴并将其传递到绘图('df.test.plot(ax = ax)'),而不是执行plt.close(),则只需执行轴清除了'ax.cla()',内存仍然爬起来,但速度并不快。除此之外,我不确定发生了什么。 – Ajean 2014-12-04 17:26:16

+4

此问题似乎是无关紧要的,因为它是一个错误报告。适当时,您应该用matplotlib或pandas提交错误。 – 2014-12-04 18:46:49

+0

此代码适用于我的Mac,无泄漏内存......非常奇怪! – 2014-12-04 21:00:02

回答

5

这显然是一个错误。如果这个问题支左右,如下修改代码消除我的计算机上的内存泄漏:

import sys 
import gc 
import numpy as np 
import matplotlib 
matplotlib.use('Agg') 
import matplotlib.pyplot as plt 
import pandas as pd 

pdindex = pd.date_range(start='01/01/2013', freq='15min', end='01/01/2019') 
df = pd.DataFrame({'test':np.linspace(0,1,len(pdindex))}, index=pdindex) 

fig, ax = plt.subplots() 

def memplot_plot(df, i, f, a): 
    a.plot(df.index, df.test) 
    a.set_title('graph' + str(i)) 
    f.savefig(str(i) + '.png', dpi=144) 
    a.cla() 

for i in range(1, 100): 
    print '*******************************' 
    print 'i : ' + str(i)  
    print len(gc.get_objects()) 
    print sys.getsizeof(gc.get_objects()) 
    memplot_plot(df, i, fig, ax)  
    gc.collect() 

输出现在看起来是这样的:

... 
******************************* 
i : 13 
83727 
732816 
******************************* 
i : 14 
83727 
732816 
******************************* 
i : 15 
83727 
732816 

...等

+0

它在我的PC上运行良好!Thx。因此,熊猫的错误​​,我会提交。 – 2014-12-04 21:49:32