2016-05-16 65 views
1

我试图使用JavaScript分割函数来标记以下句子。如何使用javascript令牌化句子

CHRIS NISWANDEE, 
    (SMALLSYS INC, 
    795 E DRAGRAM), 
    TUCSON AZ 85705, 
    USA 

我预期的结果是,

"chris","niswnadee",",","(","smallsys","inc","785","e","dgram","("... 
etc 

林能够在使用下面的代码字boundries分裂,

"CHRIS NISWANDEE, (SMALLSYS INC, 795 E DRAGRAM), TUCSON AZ 85705, USA".split(/\b\s+/) 

有没有什么办法可以让那些逗号和parenthesizes在我结果?

回答

3

似乎你想分开/\s+|\b/也许。

这意味着: “空白的任何序列(\s+|)任何字边界(\b)”

"CHRIS NISWANDEE, (SMALLSYS INC, 795 E DRAGRAM), TUCSON AZ 85705, USA".split(/\s|\b/) 

输出

["CHRIS", "NISWANDEE", ",", "(", "SMALLSYS", "INC", ",", "795", "E", "DRAGRAM", "),", "TUCSON", "AZ", "85705", ",", "USA"]