2017-07-03 198 views
1

我试图将生成的列表导出到csv文件,其中网站表中的每一行对应于文件中的新行,并且每个值都位于单个单元格中,例如:将python列表值写入csv文件

NAME.....ICO DATE....ICO PRICE....CURR. PRICE....24 HR ROI Stratis.....06/20/16.......$0.007...........$7.480................+38.80%

电流输出看起来是这样的:

['Patientory\n05/31/17\n$0.104\n$0.274\n+46.11%\n+25.54%\nN/A']

import csv 
from selenium import webdriver 
from selenium.webdriver.common.by import By 
from selenium.webdriver.support import expected_conditions as EC 
from selenium.webdriver.support.ui import WebDriverWait as wait 

csvrows = [] 

def get_css_sel(selector): 
    posts = browser.find_elements_by_css_selector(selector) 
    for post in posts: 
     print(post.text) 
     csvrows.append([post.text]) 

browser = webdriver.Chrome(executable_path=r'C:\Scrapers\chromedriver.exe') 
browser.get("https://icostats.com") 
wait(browser, 20).until(EC.presence_of_element_located((By.CSS_SELECTOR, "#app > div > div.container-0-16 > div.table-0-20 > div.tbody-0-21 > div:nth-child(2) > div:nth-child(8)"))) 

get_css_sel("#app > div > div.container-0-16 > div.table-0-20 > div.tableheader-0-50")    #fetch header of table 
get_css_sel("#app > div > div.container-0-16 > div.table-0-20 > div.tbody-0-21 > div")    #fetch rows of table 

def create_csv(thelist): 
    with open('ICO.csv', 'w') as myfile: 
     for i in thelist: 
      wr = csv.writer(myfile, quoting=csv.QUOTE_ALL) 
      wr.writerow([i]) 

create_csv(csvrows) 

回答

2

get_css_sel(),每个post.text包含换行符\n分开行文字 - 与您的examp输出。所以追加[post.text]附加一个列表与单个项目的整个行。修改成:

csvrows.append(post.text.split('\n')) # remove the extra list brackets 
             # since split returns a list. 

例:

>>> y = 'Patientory\n05/31/17\n$0.104\n$0.274\n+46.11%\n+25.54%\nN/A' 
>>> y.split('\n') 
['Patientory', '05/31/17', '$0.104', '$0.274', '+46.11%', '+25.54%', 'N/A'] 

此外,在你的写作圈,你不应该重新创建csv.writer的每一行,遍历thelist之前就去做一次。

由于您拥有csvrows中的所有行,因此您可以直接使用csvwriter.writerows

def create_csv(thelist): 
    with open('ICO.csv', 'w') as myfile: 
     wr = csv.writer(myfile, quoting=csv.QUOTE_ALL) 
     wr.writerows(thelist) 
+0

这就是它!另外,如何删除引号?我无法调用remove():AttributeError:'NoneType'对象没有属性'remove' – tklein

+0

你试图删除哪些引号?在CSV中,您已将'quoting = csv.QUOTE_ALL'。删除,如果你不想要不必要的引号。此外,[默认的'dialect'是'excel'](https://docs.python.org/3/library/csv.html#csv.writer)通常就足够了。 – aneroid

+0

如果你得到空行并且没有'post.text'的文本,那么在'csvrows.append ...'之前放置'if post.text:'。 – aneroid

1

试试这个代码:

import csv 
from selenium import webdriver 
from selenium.webdriver.common.by import By 
from selenium.webdriver.support import expected_conditions as EC 
from selenium.webdriver.support.ui import WebDriverWait as wait 

csvrows = [] 
def get_css_sel(selector): 
    posts = browser.find_elements_by_css_selector(selector) 
    for post in posts: 
     print(post.text) 
     csvrows.append(post.text) 

browser = webdriver.Chrome(executable_path=r'//Users/Pranavtadepalli/Downloads/chromedriver') 
browser.get("https://icostats.com") 
wait(browser, 20).until(EC.presence_of_element_located((By.CSS_SELECTOR, "#app > div > div.container-0-16 > div.table-0-20 > div.tbody-0-21 > div:nth-child(2) > div:nth-child(8)"))) 

get_css_sel("#app > div > div.container-0-16 > div.table-0-20 > div.tableheader-0-50")    #fetch header of table 
get_css_sel("#app > div > div.container-0-16 > div.table-0-20 > div.tbody-0-21 > div")    #fetch rows of table 
new=[",".join(elem.split("\n")) for elem in csvrows] 
newfile=open("csvfile.csv",'r') 
newfile1=open("csvfile.csv",'w') 
newstuff=newfile.read() 
for elem in new: 
    newfile1.write(elem+'\n') 
newfile1.close() 
newfile.close()