2017-03-01 79 views
1

我想使用Spring批处理基于属性值解析xml,下面是XML供参考。使用Spring批量解析基于属性值的xml片段

<?xml version="1.0" encoding="UTF-8"?> 
<customerInfo> 
<cutommer dept="IT"> 
<param value="Jane" name="first-name"/> 
<param value="Doe" name="last-name"/> 
<param value="17 Streets" name="address"/> 
<param value="1234567" name="phone-number"/> 
</customer> 
<cutommer dept="ES"> 
<param value="Jane" name="first-name"/> 
<param value="Doe" name="last-name"/> 
<param value="17 Streets" name="address"/> 
<param value="1234567" name="phone-number"/> 
</customer> 
</customerInfo> 

基础上上面的XML要来解析客户标签,其部门属性附加伤害值是“IT”。任何帮助是appriciated

更新1:

@Configuration 
@EnableBatchProcessing 
public class ControllerInfoParser_Config extends DefaultBatchConfigurer { 

    @Autowired 
    private JobBuilderFactory jobs; 
    @Autowired 
    private StepBuilderFactory steps; 
    @Bean 
    public Job parseComponentInfoXML(Step parseComponentInfo,Step partitionStep, CustomJobExecutionerListener customJobExecutionerListener) 
      throws UnexpectedInputException, ParseException, Exception { 

     return jobs.get("parseComponentInfoXML").listener(customJobExecutionerListener).start(parseComponentInfo) 
       .next(partitionStep).build(); 


    } 
    @Bean 
    public Step parseComponentInfo(ItemReader<Customer> oneDeptITItemReader) throws UnexpectedInputException, ParseException, Exception { 

     return steps.get("parseComponentInfo").<Customer, Customer> chunk(1) 
       .reader(componentInfoReader()).reader(oneDeptITItemReader).processor(componentInfoProcessor()) 
       .writer(componentInfoWriter()).build(); 
    } 

    @Bean 
    public ItemReader<Customer> componentInfoReader() throws UnexpectedInputException, ParseException, Exception { 

     //OneDeptITItemReader <Customer> reader1 = new OneDeptITItemReader<Customer>(); 
     StaxEventItemReader<Customer> reader = new StaxEventItemReader<Customer>(); 
     reader.setResource(new ClassPathResource("xml//customer.xml")); 
     reader.setFragmentRootElementName("customer"); 

     Jaxb2Marshaller marshaller = new org.springframework.oxm.jaxb.Jaxb2Marshaller(); 
     marshaller.setClassesToBeBound(Customer.class); 

     // marshaller.setSchema(new ClassPathResource("xml//company.xsd")); 

     reader.setUnmarshaller(marshaller); 

     return reader; 
    } 

    @Bean 
    public ItemReader<Customer> oneDeptITItemReader(ItemReader<Customer> ir) { 
     OneDeptITItemReader<Customer> odIR = new OneDeptITItemReader<Customer>(); 
     odIR.setDelegate(ir); 
     return odIR; 
    } 
    @Bean 
    public ItemProcessor<Customer, Customer> componentInfoProcessor() { 

     return new CustomerProcessor(); 
    } 

    @Bean 
    public ItemWriter<Object> componentInfoWriter() { 

     return new SqlWritter(); 
    } 
} 

public class OneDeptITItemReader <T> implements ItemReader <Customer>{ 

    ItemReader<Customer> delegate; 

     public ItemReader<Customer> getDelegate() { 
     return delegate; 
    } 

    public void setDelegate(ItemReader<Customer> delegate) { 
     this.delegate = delegate; 
    } 

    @Override 
    public Customer read() { 
     boolean read = true; 
     Customer item = null; 
     while(read) { 
      try { 
      item = delegate.read(); 
     } catch (Exception e) { 
      // TODO Auto-generated catch block 
      e.printStackTrace(); 
      read =false; 
     } 
     read = !"IT".equals(item.getDept()); 
     } 
     return item; 
     } 

} 
+0

不要专注于阅读,但在过程阶段:用自定义'ItemProcessor中<客户,客户>'返回null部门<>“IT”或返回对象本身,如果部门是等于“IT” –

+0

感谢Luca提供的建议,早些时候我考虑过这种方法,但是我的XML文件在15 MB左右会很大,并且它只包含一个dept属性值为“IT”的片段,剩下的数千个客户片段将不必要的解析并到达ItemProcessor 。一旦我们得到IT部门的客户片段以避免不必要的资源消耗,是否有办法阻止进一步的批处理流程? –

回答

0

停止阅读是这样的从ItemReader.read()返回null
编写一个自定义ItemReader委托,并在找到“IT”部门后停止阅读。

class OneDeptITItemReader implements ItemReader<Customer> { 
    ItemReader<Customer> delegate; 

    @Override 
    public Customer read() { 
    boolean read = true; 
    while(read) { 
    Customer item = delegate.read(); 
    read = read != null && !"IT".equals(item.getDept()); 
    } 
    return item; 
    } 
} 

使用委托时,您必须将委派的读者注册为流,以让SB管理其生命周期。 请参阅6.5 The Delegate Pattern and Registering with the Step

+0

感谢卢卡分享片段,我试着上面的代码,但调用delegate.read()时得到空指针异常。 –

+0

我想你在调用我的代码片段之前设置了一个有效的委托。 –

+0

我已经提出了实现细节请参阅更新1:让我知道是否有任何事情需要改变。 –

0

下面的代码段适用于我。

停止阅读的方法是从ItemReader.read()返回null。 编写一个自定义ItemReader委托,并在找到“IT”部门后停止阅读。

class OneDeptITItemReader implements ItemReader<Customer> { 
StaxEventItemReader<Customer> delegate; 

public void setDelegate(StaxEventItemReader<Customer> delegate) { 
     this.delegate = delegate; 
    } 

    @Override 
    public Customer read() { 
    boolean read = true; 
    delegate.open(new ExecutionContext()); 
    Customer item = null; 
    while(read) { 
    item = delegate.read(); 
    read = item != null && !"IT".equals(item.getDept()); 
    } 
    return item; 
    } 
}