2012-01-05 58 views
-2

我想提取文本文件的某些部分。我的输入文件:如何从文件中提取某些文本?

-- num cell port function safe [ccell disval rslt] 
    "17 (BC_1, CLK, input, X)," & 
    "16 (BC_1, OC_NEG, input, X), " &-- Merged input/ 
    " 8 (BC_1, D(8), input, X)," & -- cell 16 @ 1 -> Hi-Z 
    " 7 (BC_1, Q(1), output3, X, 16, 1, Z)," & 
    " 0 (BC_1, Q(8), output3, X, 16, 1, Z)"; 

,我需要的输出是这样:

num cell port function safe ccell 
    17 BC_1 CLK input X 
    16 BC_1 OC_NEG input X 
    16 BC_1 * control 1 
    8 BC_1 D8 input X 
    7 BC_1 Q1 output3 X 16 1 
    0 BC_1 Q8 output3 X 16 1 

到目前为止,我尝试下面的代码,但它给了指数的错误。请建议。

import re 
lines=open("input.txt",'r').readlines() 

for line in lines: 
    a=re.findall(r'\w+',line) 
    print re.findall(r'\w+',line) 
    print a[0],a[1],a[2],a[3],a[4],a[5],a[6] 

我使用的Python 2.6.6和错误如下:

['num', 'cell', 'port', 'function', 'safe', 'ccell', 'disval', 'rslt'] 
num cell port function safe ccell disval 
['17', 'BC_1', 'CLK', 'input', 'X'] 
17 BC_1 CLK input X 
Traceback (most recent call last): 
    File "C:\Users\ctee1\Desktop\pyparsing\outputparser.py", line 39, in <module> 
    print a[0],a[1],a[2],a[3],a[4],a[5],a[6] 
IndexError: list index out of range 
+0

什么语言是这个呢?我们能否得到确切的异常信息的副本? – Dessus 2012-01-05 03:54:15

+0

可能是python? – 2012-01-05 04:00:38

回答

0
sed -r 's/(disval|rslt)//; s/(.)--.*/\1/; s/[^[:alnum:]_]//; s/ +$//' 
+0

是的,我在Python 2.6.6上运行,错误如下: – maximus 2012-01-05 05:00:34