2016-11-25 80 views
0

已阅读所有关于此的线程,但我仍然遇到了死路。试图将所有csv放在一个目录中,并将它们添加为一张新的xlsx工作簿。下面是我得到了什么:脚本将多个csv合并为单个xslx不起作用

import xlwt, csv, os, glob 

def make_excel_workbook(path): 
    wb = xlwt.Workbook() 
    for filename in os.listdir(folder_path): 
     if filename.endswith('.csv'): 
      ws = wb.add_sheet(os.path.splitext(filename)[0]) 
      with open('{}\\{}'.format(folder_path, filename), 'rb') as csvfile: 
       reader = csv.reader(csvfile, delimiter=',') 
       for rowx, row in enumerate(reader): 
        for colx, value in enumerate(row): 
         ws.write(rowx, colx, value) 
    return wb 

csvDir = "C:\\Temp\\Data\\outfiles" 
outDir = "C:\\Temp\\Data\\output" 

os.chdir(csvDir) 
csvFileList = [] 
searchTerm = "character string" 

for file in glob.glob('*.csv'): 
    csvFileList.append(file) 

for i in csvFileList: # search a set of extant csv files for a string and make new csv files filtered on the search term 
    csv_file = csv.reader(open(i, 'rb'), delimiter=',') 
    rowList = [] 
    for row in csv_file: 
     for field in row: 
      if searchTerm in field: 
       rowList.append(row) 
    outputCsvFile = os.path.join(rootDir, i) 
    with open(outputCsvFile, 'wb') as newCsvFile: 
     wr = csv.writer(newCsvFile, quoting=csv.QUOTE_ALL) 
     wr.writerows(rowList) 

到目前为止,它的工作原理,并从原来的大得多,那些创建新的CSV文件。在此处,它打破:

if __name__ == '__main__': 
    xls = make_excel_workbook(outDir) 
    xls_name = "My_Team_Tasks" 
    xls.save('{}\\{}{}.'format(outDir, xls_name, '.xls')) 
    print('{}\\{}{} saved successfully'.format(outDir, xls_name, '.xls')) 

当它到达xls.save,它给了我下面的错误:

更新:这里是整个回溯:

Traceback (most recent call last): 
    File"M:/Testing/scripts/csv_parse.py", line 44, in <module> 
     xls.save('{}\\{}{}'.format(rootDir, xls_name, '.xls')) 
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\Workbook.py", line 696, in save 
     doc.save(filename_or_stream, self.get_biff_data()) 
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\Workbook.py", line 660, in get_biff_data 
     shared_str_table = self.__sst_rec() 
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\Workbook.py", line 662, in __sst_rec 
     return self.__sst.get_biff_record() 
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\BIFFRecords.py", line 77, in get_biff_record 
     self._add_to_sst(s) 
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\BIFFRecords.py", line 92, in _add_to_sst 
     u_str = upack2(s, self.encoding) 
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\UnicodeUtils.py", line 50, in upack2 
     us = unicode(s, encoding) 
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 69: ordinal not in range (128) 
+0

你能否包含整个堆栈跟踪? – danielx

+0

好像你正在使用python3并打开文件为二进制文件,只需在所有打开的语句中将''rb''更改为''r''。 – Dalvenjia

+0

@丹尼尔斯,包括追踪。 – auslander

回答

0

你知道输入CSV文件如何编码?它从错误消息看来是unicode?

你可以试试:

wb = xlwt.Workbook(encoding='utf-8') 

做不到这一点,按照这个答案(xlwt module - saving xls unicode error)似乎另一种可能的方法来解决这个问题写出来之前,你的文字转换成Unicode编码。

ws.write(rowx, colx, value.decode('utf-8')) 

同样,这取决于您的输入是如何编码的。

+0

Quichao先生,第二个建议,把行写成UTF-8,完美无缺!我现在正在得到我期望的输出。非常感谢。 – auslander