设计nifi模板时,我们是否需要考虑底层集群?如何为集群设计模板nifi
这里是我的简单流程
+-----------------+ +---------------+ +-----------------+
| | | | | |
| READ FROM | | MERGE | | PUT HDFS |
| KAFKA | | FILES | | |
| +-----------------------> | +---------------------> | |
| | | | | |
| | | | | |
| | | | | |
+-----------------+ +---------------+ +-----------------+
我有3个节点的集群。当系统运行时我检查“集群”菜单中,只有主节点正在使用来源看,其他群集节点似乎闲置...问题是在这样一个集群中,我应该根据集群设计模板还是应该进行负载平衡。
我看到我的一位同事为集群上的每个节点创建了远程处理器,并将负载均衡器放在模板内部,这是否是必需的? (如下图所示)
+------------------+
| | +-------------+
| REMOTE PROCESS | | input port |
+----> | GROUP FOR | | (rpg) |
| | NODE 1 | +-------------+
| | | |
| | | |
| +------------------+ v
+-----------------+ +-----------------+ RPG
| | | | | +--------------+
| READ FROM | | | | | |
| KAFKA | | LOAD BALANCER | | +------------------+ | MERGE FILES |
| +-------------> | +-------------> | | | |
| | | | | | REMOTE PROCESS | | |
| | | | | | GROUP FOR | | |
| | | | | | NODE 2 | | |
+-----------------+ +-----------------+ RPG | | +--------------+
| +------------------+ |
| |
| v
|
| +-------------------+ +---------------+
| | | | |
| | REMOTE PROCESS | | PUT HDFS |
+-----> | GROUP FOR | | |
| NODE 3 | | |
| | | |
| | | |
+-------------------+ +---------------+
什么是用例的负载平衡器,除了远程集群,我可以使用负载均衡流量分成几个处理器用来加快操作?
“首先,确保ConsumeKafka是不是只安排了主节点。” yesss ..这是问题..谢谢 – ygk