2013-08-23 60 views
0
>>> with open('Book2.csv', 'rb') as f: 
    inpreader = csv.reader(f, delimiter=',') 
    for row in inpreader: 
     print ','.join(row) 

IdNo, skillsList 
BAXA0000206_NEENA_TIWARI_0.htm,[u'Training', u'E-Learning', u'PowerPoint', u'Teaching', u'Accounting', u'Team Management', u'Team Building', u'Microsoft Excel', u'Microsoft Office', u'Financial Accounting', u'Microsoft Word', u'Customer Service'] 
BAXA0000227_ABDUR_RAZZAQUE_0.htm,[u'Telecommunications', u'Data Center', u'ISO 27001', u'Management', u'BS25999', u'Technology', u'Information Technology...', u'Certified PMP\xae', u'Certified BS25999 Lead...'] 
BAXA0000261_Priya _ Lobo_0.htm,[u'Market Research', u'Segmentation', u'Marketing Strategy', u'Consumer Behavior', u'Experience Working with...'] 

我想找出skillsList每个技能的词频..我该怎么办呢?通过列表中的项目分组

感谢

+0

哪里频率字段? –

+0

skillsList。我想要每个技能的术语频率。例如,培训:45,电子学习:50 – user1946217

+0

CSV文件的外观如何? (我想你现在正在显示你的代码的输出,而不是输入) –

回答

0
counter = {} 
for skill_list in lists: 
    for skill in skill_list: 
     counter[skill] = counter.get(skill, 0) + 1 

print counter 
+0

我这样做,没有工作。你能否纠正我错误的地方? >>>具有开放( 'Book2.csv', 'RB')为f: \t inpreader = csv.reader(F,分隔符= ' ') \t用于inpreader行: \t \t解释=',' 。加入(行) \t \t >>> LEN(列表) >>>计数器= {} >>>用于解释skill_list: \t为技术人员在skill_list: \t \t计数器[技能] = counter.get(skill,0)+ 1 \t \t >>> print counter {',':1} – user1946217

+0

counter = {} with open('Book2.csv','rb')as f: inpreader = csv.reader(f,delimiter =',' ) 用于inpreader行: 用于行技能[1]: 计数器[技能] = counter.get(技能,0)+ 1个 打印 '' 加入(行) 打印计数器 [链接] (http://simonsagi.com/sites/stackoverflow/frequency.txt) –