我需要通过复制和粘贴来自维基百科的一些数据来读取创建的csv文件。这些数据是根据其来源分类的大学列表。 我想要做的是将这些数据导入熊猫数据框中,其中索引是州的名称。但是,当我使用read_csv导入csv时,数据是一维的,州名与大学名称在同一列。 从这个数据框我现在应该从第一列中提取状态并将它们用作索引。不知道如何做到这一点。 我想我可以尝试一个for/if循环与状态名称列表;但可能会有更快更优雅的方式。 有什么建议吗?pandas read_csv方法从列表中建立索引
这是CSV文件的样子:
Alabama[edit]
Auburn (Auburn University, Edward Via College of Osteopathic Medicine)[14]
Birmingham (University of Alabama at Birmingham, Birmingham School of Law, Cumberland School of Law, Miles Law School)[15]
Dothan (Fortis College, Troy University Dothan Campus, Alabama College of Osteopathic Medicine)
Florence (University of North Alabama)
Homewood (Samford University)
Huntsville (University of Alabama, Huntsville)
Jacksonville (Jacksonville State University)[16]
Livingston (University of West Alabama)[16]
Mobile (University of South Alabama)[17]
Montevallo (University of Montevallo, Faulkner University)[16]
Montgomery (Alabama State University, Huntingdon College, Auburn University at
Montgomery, H. Councill Trenholm State Technical College, Faulkner University)
Troy (Troy University)[16]
Tuscaloosa (University of Alabama, Stillman College, Shelton State)[18][19]
Tuskegee (Tuskegee University)[20]
Alaska[edit]
Anchorage[21] (University of Alaska Anchorage)
Fairbanks (University of Alaska Fairbanks)[16]
Juneau (University of Alaska Southeast)
Ketchikan (University of Alaska Southeast-extended campus)
Sitka (University of Alaska Southeast-extended campus)
非常感谢!
你可以把数据的样本? – MedAli
刚刚粘贴的部分数据 – Jemba88
感谢@ayhan!我从那里解决了 – Jemba88