我想使用Spring批处理基于属性值解析xml,下面是XML供参考。使用Spring批量解析基于属性值的xml片段
<?xml version="1.0" encoding="UTF-8"?>
<customerInfo>
<cutommer dept="IT">
<param value="Jane" name="first-name"/>
<param value="Doe" name="last-name"/>
<param value="17 Streets" name="address"/>
<param value="1234567" name="phone-number"/>
</customer>
<cutommer dept="ES">
<param value="Jane" name="first-name"/>
<param value="Doe" name="last-name"/>
<param value="17 Streets" name="address"/>
<param value="1234567" name="phone-number"/>
</customer>
</customerInfo>
基础上上面的XML要来解析客户标签,其部门属性附加伤害值是“IT”。任何帮助是appriciated
更新1:
@Configuration
@EnableBatchProcessing
public class ControllerInfoParser_Config extends DefaultBatchConfigurer {
@Autowired
private JobBuilderFactory jobs;
@Autowired
private StepBuilderFactory steps;
@Bean
public Job parseComponentInfoXML(Step parseComponentInfo,Step partitionStep, CustomJobExecutionerListener customJobExecutionerListener)
throws UnexpectedInputException, ParseException, Exception {
return jobs.get("parseComponentInfoXML").listener(customJobExecutionerListener).start(parseComponentInfo)
.next(partitionStep).build();
}
@Bean
public Step parseComponentInfo(ItemReader<Customer> oneDeptITItemReader) throws UnexpectedInputException, ParseException, Exception {
return steps.get("parseComponentInfo").<Customer, Customer> chunk(1)
.reader(componentInfoReader()).reader(oneDeptITItemReader).processor(componentInfoProcessor())
.writer(componentInfoWriter()).build();
}
@Bean
public ItemReader<Customer> componentInfoReader() throws UnexpectedInputException, ParseException, Exception {
//OneDeptITItemReader <Customer> reader1 = new OneDeptITItemReader<Customer>();
StaxEventItemReader<Customer> reader = new StaxEventItemReader<Customer>();
reader.setResource(new ClassPathResource("xml//customer.xml"));
reader.setFragmentRootElementName("customer");
Jaxb2Marshaller marshaller = new org.springframework.oxm.jaxb.Jaxb2Marshaller();
marshaller.setClassesToBeBound(Customer.class);
// marshaller.setSchema(new ClassPathResource("xml//company.xsd"));
reader.setUnmarshaller(marshaller);
return reader;
}
@Bean
public ItemReader<Customer> oneDeptITItemReader(ItemReader<Customer> ir) {
OneDeptITItemReader<Customer> odIR = new OneDeptITItemReader<Customer>();
odIR.setDelegate(ir);
return odIR;
}
@Bean
public ItemProcessor<Customer, Customer> componentInfoProcessor() {
return new CustomerProcessor();
}
@Bean
public ItemWriter<Object> componentInfoWriter() {
return new SqlWritter();
}
}
public class OneDeptITItemReader <T> implements ItemReader <Customer>{
ItemReader<Customer> delegate;
public ItemReader<Customer> getDelegate() {
return delegate;
}
public void setDelegate(ItemReader<Customer> delegate) {
this.delegate = delegate;
}
@Override
public Customer read() {
boolean read = true;
Customer item = null;
while(read) {
try {
item = delegate.read();
} catch (Exception e) {
// TODO Auto-generated catch block
e.printStackTrace();
read =false;
}
read = !"IT".equals(item.getDept());
}
return item;
}
}
不要专注于阅读,但在过程阶段:用自定义'ItemProcessor中<客户,客户>'返回null部门<>“IT”或返回对象本身,如果部门是等于“IT” –
感谢Luca提供的建议,早些时候我考虑过这种方法,但是我的XML文件在15 MB左右会很大,并且它只包含一个dept属性值为“IT”的片段,剩下的数千个客户片段将不必要的解析并到达ItemProcessor 。一旦我们得到IT部门的客户片段以避免不必要的资源消耗,是否有办法阻止进一步的批处理流程? –