2017-08-28 125 views
-1

我需要在python中将他输出转换为Json格式。在Python中将表格CLI输出转换为JSON格式

我该怎么办?

switch# sh mod 
Mod Ports Module-Type       Model    Status 
--- ----- ----------------------------------- ------------------ ---------- 
1 48  1/2/4/8 Gbps FC/Supervisor-3  DS-C9148-K9-SUP active * 

Mod Sw    Hw  World-Wide-Name(s) (WWN) 
--- -------------- ------ -------------------------------------------------- 
1 6.2(17)   1.1  20:01:54:7f:ee:df:88:f8 to 20:30:54:7f:ee:df:88:f8 


Mod MAC-Address(es)       Serial-Num 
--- -------------------------------------- ---------- 
1 c0-8c-60-65-82-dc to c0-8c-60-65-82-df JAF1736ALLM 

输入1:https://i.stack.imgur.com/EGsY4.jpg

输入2:https://i.stack.imgur.com/aDGcB.jpg

+3

1 。输出应该是什么样子,以及2.你尝试过的什么都不起作用? –

+1

我想说你必须使用复杂的常规ex或有状态行解析器。不幸的是,两者都会处于挑战和丑陋之间。 –

回答

0

您可以使用'---'分隔符来定义每个键和值行的切片以构建每个键值。 (从你的榜样,我猜有多个“国防部”,与时俱进的独特国防部的价值观,所以我用这个领域的整体蓄电池键。)

from collections import defaultdict 
import re 
from itertools import groupby 

sample = """\ 
Mod Ports Module-Type       Model    Status 
--- ----- ----------------------------------- ------------------ ---------- 
1 48  1/2/4/8 Gbps FC/Supervisor-3  DS-C9148-K9-SUP active * 
2 48  1/2/4/8 Gbps FC/Supervisor-3  DS-C9148-K9-SUP active * 

Mod Sw    Hw  World-Wide-Name(s) (WWN) 
--- -------------- ------ -------------------------------------------------- 
1 6.2(17)   1.1  20:01:54:7f:ee:df:88:f8 to 20:30:54:7f:ee:df:88:f8 
2 6.2(17)   1.1  20:01:54:7f:ee:df:88:f8 to 20:30:54:7f:ee:df:88:f8 

Mod MAC-Address(es)       Serial-Num 
--- -------------------------------------- ---------- 
1 c0-8c-60-65-82-dc to c0-8c-60-65-82-df JAF1736ALLM 
2 c0-8c-60-65-82-ec to c0-8c-60-65-82-ef JAF1736AXXX 

Xbar Ports Module-Type Model Status 
---- ----- ----------- ----- ------ 
1 0  Fabric 1 ABC ok 

Xbar Sw Hw 
---- -- --- 
1 NA 1.0 

""" 

all_input_lines = sample.splitlines() 
mod_accum = defaultdict(dict) 
xbar_accum = defaultdict(dict) 

for is_blank, input_lines_iter in groupby(all_input_lines, 
              key=lambda s: not bool(s.strip())): 
    input_lines = list(input_lines_iter) 
    if is_blank: 
     continue 

    # assume first two lines are field names and separator dashes 
    names, dashes = input_lines[:2] 

    # make sure dashes line is all '---' separators 
    if not all(ss == set('-') for ss in map(set, dashes.split())): 
     print("invalid line group found, skipping...") 
     print('-'*40) 
     print('\n'.join(input_lines)) 
     print('-'*40) 
     continue 

    # use regex to get start/end of each '---' divider, and make slices 
    spans = (match.span() for match in re.finditer('-+', dashes)) 
    slices = [slice(sp[0], sp[1]+1) for sp in spans] 

    names = [names[sl].rstrip() for sl in slices] 

    # is this a module or an xbar? 
    if 'Mod' in names: 
     key = 'Mod' 
     accum = mod_accum 
    elif 'Xbar' in names: 
     key = 'Xbar' 
     accum = xbar_accum 
    else: 
     raise ValueError("no Mod or Xbar name in row names ({})".format(
          ",".join(names))) 

    for line in input_lines: 
     # use slices to extract data from values, make into a dict 
     row_dict = dict(zip(names, (line[sl].rstrip() for sl in slices))) 

     # accumulate these values into any previous ones collected for this Mod 
     accum[row_dict[key]].update(row_dict) 

# print out what we got 
import json 
all_data = {"Modules": mod_accum, "Xbars": xbar_accum} 
print(json.dumps(all_data, indent=2)) 

打印:

{ 
    "Modules": { 
    "2": { 
     "World-Wide-Name(s) (WWN)": "20:01:54:7f:ee:df:88:f8 to 20:30:54:7f:ee:df:88:f8", 
     "Module-Type": "1/2/4/8 Gbps FC/Supervisor-3", 
     "Ports": "48", 
     "Sw": "6.2(17)", 
     "Hw": "1.1", 
     "Model": "DS-C9148-K9-SUP", 
     "Status": "active *", 
     "Serial-Num": "JAF1736AXXX", 
     "MAC-Address(es)": "c0-8c-60-65-82-ec to c0-8c-60-65-82-ef", 
     "Mod": "2" 
    }, 
    "1": { 
     "World-Wide-Name(s) (WWN)": "20:01:54:7f:ee:df:88:f8 to 20:30:54:7f:ee:df:88:f8", 
     "Module-Type": "1/2/4/8 Gbps FC/Supervisor-3", 
     "Ports": "48", 
     "Sw": "6.2(17)", 
     "Hw": "1.1", 
     "Model": "DS-C9148-K9-SUP", 
     "Status": "active *", 
     "Serial-Num": "JAF1736ALLM", 
     "MAC-Address(es)": "c0-8c-60-65-82-dc to c0-8c-60-65-82-df", 
     "Mod": "1" 
    } 
    }, 
    "Xbars": { 
    "1": { 
     "Module-Type": "Fabric 1", 
     "Ports": "0", 
     "Sw": "NA", 
     "Hw": "1.0", 
     "Model": "ABC", 
     "Status": "ok", 
     "Xbar": "1" 
    } 
    } 
} 
+0

感谢Paul的建议。上面的代码完美地适用于一个模块。 然而,对于输入1.它的抛出键错误,因为这里有一个新的rowname'xbar'。任何想法我们如何处理这一点。 此外,它不是迭代为输入2的下一组模块。 – Aftab

+0

写完后,我有一种感觉,这将是多个模块的情况。重写为使用itertools.groupby抽出一组行,并在有一组非数据行的情况下进行一些错误检查。你从这里学不到很多Python,但也许这对你来说是一个有用的代码示例。 – PaulMcG

1

我有一个解决方案,但它是不漂亮。假设你的整个输出是在text

import re 
lines = text.split("\n") 
keylines = [line for i, line in enumerate(lines) if len(lines)>(i+1) and "---" in lines[i+1]] 
vallines = [line for i, line in enumerate(lines) if i!=0 and "---" in lines[i-1]] 
keys = re.split(" +", " ".join(keylines)) 
vals = re.split(" +", " ".join(vallines)) 
result = dict(zip(keys, vals)) 

输出:

{ 
    "Mod": "1", 
    "Ports": "48", 
    "Module-Type": "1/2/4/8 Gbps FC/Supervisor-3", 
    "Model": "DS-C9148-K9-SUP", 
    "Status": "active *", 
    "Sw": "6.2(17)", 
    "Hw": "1.1", 
    "World-Wide-Name(s) (WWN)": "20:01:54:7f:ee:df:88:f8 to 20:30:54:7f:ee:df:88:f8", 
    "MAC-Address(es)": "c0-8c-60-65-82-dc to c0-8c-60-65-82-df", 
    "Serial-Num": "JAF1736ALLM" 
} 

它做以下假设,当他们是不是真的会打破:

  • 没有值包含连续多个空格。
  • “字段”之间至少有两个空格。
  • 在与破折号的行中,至少有一个3破折号的段。