2012-01-06 68 views
0

我有2个excel文件,我已经转换成列表。第一个文件有我需要的所有项目的完整列表。但是,第二个列表中有一小部分需要在第一个列表中更改的项目。 这是我的第一个名单是如何构建的:构建如何将2个LINQ词典合并为1?

IEnumerable<ExcelRow> queryListA = from d in datapullList 
              select new ExcelRow 
              { 
               Company = d.GetString(0), 
               Location = d.GetString(1), 
               ItemPrice = d.GetString(4), 
               SQL_Ticker = d.GetString(15) 
              }; 

第二届名单中非常相似的方式:

IEnumerable<ExcelRow> queryListB = from dupes in dupespullList 
              select new ExcelRow 
              { 
               Company = d.GetString(0), 
               Location = d.GetString(1), 
               NewCompany = d.GetString(4) 
              }; 

所以,如果有一家公司从1日列表中的特定位置匹配第二个清单,那么公司将被更改为新的公司名称。

然后,我的最终列表应该包含第一列表中的所有内容,但是第二列表中指定的更改应该包含在内。

我一直在为此奋斗了几天。让我知道你是否需要更多细节。

[更新:]我很新LINQ和C#。我在网上找到了关于Office 2003的Excel阅读器的代码。如何从以下所有类创建1列表(如上所述)? 我ExcelRow类:

class ExcelRow 
{  
    List<object> columns; 

    public ExcelRow() 
    { 
     columns = new List<object>(); 
    } 

    internal void AddColumn(object value) 
    { 
     columns.Add(value); 
    } 

    public object this[int index] 
    { 
     get { return columns[index]; } 
    } 

    public string GetString(int index) 
    { 
     if (columns[index] is DBNull) 
     { 
      return null; 
     } 
     return columns[index].ToString(); 
    } 

    public int Count 
    { 
     get { return this.columns.Count; } 
    } 
} 

我ExcelProvider类:

class ExcelProvider : IEnumerable<ExcelRow> 
{ 
    private string sheetName; 
    private string filePath; 
    private string columnName1; 
    private string columnName2; 
    private List<ExcelRow> rows; 

    public ExcelProvider() 
    { 
     rows = new List<ExcelRow>(); 
    } 

    public static ExcelProvider Create(string filePath, string sheetName, string columnName1, string columnName2) 
    { 
     ExcelProvider provider = new ExcelProvider(); 
     provider.sheetName = sheetName; 
     provider.filePath = filePath; 
     provider.columnName1 = columnName1; 
     provider.columnName2 = columnName2; 
     return provider; 
    } 

    private void Load() 
    {    
     string connectionString = @"Provider=Microsoft.Jet.OLEDB.4.0;Data Source={0};Extended Properties= ""Excel 8.0;HDR=YES;IMEX=1""";    
     connectionString = string.Format(connectionString, filePath); 
     rows.Clear(); 
     using (OleDbConnection conn = new OleDbConnection(connectionString)) 
     { 
      try 
      { 
       conn.Open(); 
       using (OleDbCommand cmd = conn.CreateCommand()) 
       { 
        cmd.CommandText = string.Format("SELECT * FROM [{0}$] WHERE {1} IS NOT NULL AND {2} <> \"{3}\"", sheetName, columnName1, columnName2, null); 
        using (OleDbDataReader reader = cmd.ExecuteReader()) 
        { 
         while (reader.Read()) 
         { 
          ExcelRow newRow = new ExcelRow(); 
          for (int count = 0; count < reader.FieldCount; count++) 
          { 
           newRow.AddColumn(reader[count]); 
          } 
          rows.Add(newRow); 
         } 
        } 
       } 
      } 
      catch (Exception ex) 
      { throw ex; } 
      finally 
      { 
       if (conn.State == System.Data.ConnectionState.Open) 
        conn.Close(); 
      } 
     } 
    } 

    public IEnumerator<ExcelRow> GetEnumerator() 
    { 
     Load(); 
     return rows.GetEnumerator(); 
    } 

    System.Collections.IEnumerator System.Collections.IEnumerable.GetEnumerator() 
    { 
     Load(); 
     return rows.GetEnumerator(); 
    } 
} 

所以,使用所有这样的逻辑,我怎么能解决我的问题?

回答

1
//first create a dictionary of comapny whose name has been changed 
    var dict = queryListB.ToDictionary(x => x.Company, y => y.NewCompany); 

    //loop on the first list and do the changes in the first list 
    queryListA.ForEach(x => 
         { 
          if(dict.Keys.Contains(x.Company)) 
           x.Company = dict[x.Company]; 
         }); 
+0

我觉得OP是要求在位置匹配,以及:“所以,如果有一个公司从一个特定的位置** **在符合第二个1日列表列表,然后公司变更为新公司名称“ – Joey 2012-01-06 09:52:41

+0

是的,这正是我所需要的。感谢@Joey的澄清。 – 2012-01-06 14:11:55

0

我敢肯定,你可以写简单的代码来达到同样的目标,但我已经为减少您必须通过第一和第二列表迭代次数的方式。如果性能不是问题,则只需在duapupList中搜索datapullList中的每个元素即可。

var excelRowCreator = new ExcelRowCreator(dupespullList); 
var finalRows = excelRowCreator.CreateExcelRows(datapullList); 

// ... 

public class ExcelRowCreator 
{ 
    /// <summary> 
    /// First key is company name, second is location 
    /// and final value is the replacement name. 
    /// </summary> 
    private readonly IDictionary<string, IDictionary<string, string>> nameReplacements; 

    /// <summary> 
    /// I don't know what type of objects your initial 
    /// lists contain so replace T with the correct type. 
    /// </summary> 
    public ExcelRowCreator(IEnumerable<T> replacementRows) 
    { 
     nameReplacements = CreateReplacementDictionary(replacementRows); 
    } 

    /// <summary> 
    /// Creates ExcelRows by replacing company name where appropriate. 
    /// </summary> 
    public IEnumerable<ExcelRow> CreateExcelRows(IEnumerable<T> inputRows) 
    { 
     // ToList is here so that if you iterate over the collection 
     // multiple times it doesn't create new excel rows each time 
     return inputRows.Select(CreateExcelRow).ToList(); 
    } 

    /// <summary> 
    /// Creates an excel row from the input data replacing 
    /// the company name if required. 
    /// </summary> 
    private ExcelRow CreateExcelRow(T data) 
    { 
     var name = data.GetString(0); 
     var location = data.GetString(1); 

     IDictionary<string, string> replacementDictionary; 
     if (nameReplacements.TryGetValue(name, out replacementDictionary)) 
     { 
      string replacementName; 
      if (replacementDictionary.TryGetValue(location, out replacementName)) 
      { 
       name = replacementName; 
      } 
     } 

     return new ExcelRow 
     { 
      Company = name, 
      Location = location, 
      ItemPrice = data.GetString(4), 
      SQL_Ticker = data.GetString(15) 
     }; 
    } 

    /// <summary> 
    /// A helper method to create the replacement dictionary. 
    /// </summary> 
    private static IDictionary<string, IDictionary<string, string>> CreateReplacementDictionary(IEnumerable<T> replacementRows) 
    { 
     var replacementDictionary = new Dictionary<string, IDictionary<string, string>>(); 
     foreach (var dupe in replacementRows) 
     { 
      var name = dupe.GetString(0); 
      IDictionary<string, string> locationReplacements; 
      if (!replacementDictionary.TryGetValue(name, out locationReplacements)) 
      { 
       locationReplacements = new Dictionary<string, string>(); 
       replacementDictionary[name] = locationReplacements; 
      } 

      locationReplacements[dupe.GetString(1)] = dupe.GetString(4); 
     } 

     return replacementDictionary; 
    } 
} 

UPDATE:包装为一类,并写在Visual Studio所以不应该有任何语法错误。

+0

忽略我的答案片刻,刚注意到公司要求更改名称的位置和名称相同。更新我的解决方案,以便检查它。 – Joey 2012-01-06 09:06:55

+0

答复已更新。 – Joey 2012-01-06 09:55:20

1

循环遍历queryListA并查看queryListB中是否有匹配的公司。如果是,则更新Company属性。

下面的代码:

foreach (var companyA in queryListA) 
{ 
    var companyBMatch = queryListB.FirstOrDefault(x => x.Company == companyA.Company && x.Location == companyA.Location); 
    if (companyBMatch != null) 
    companyA.Company = companyBMatch.NewCompany; 
} 
+0

这是否仍然保留我列表中的所有其他项目,还是需要再次构建它们? – 2012-01-09 21:46:02

+0

是的,它仍然保持其他项目。 – Paul 2012-01-10 02:44:54

+0

这是一个新手问题。我怎样才能从这个循环中创建一个列表? – 2012-01-15 21:48:33