我觉得这里最好的办法是用卡夫卡连接:link 但它是一个拉的方法: Kafka Connect sources are pull-based for a few reasons. First, although connectors should generally run continuously, making them pull-based means that the connector/Kafka Connect decides when data is actually pulled, which allows for things like pausing connectors without losing data, brief periods of unavailability as connectors are moved, etc. Second, in distributed mode the tasks that pull data may need to be rebalanced across workers, which means they won't have a consistent location or address. While in standalone mode you could guarantee a fixed network endpoint to work with (and point other services at), this doesn't work in distributed mode where tasks can be moving around between workers.
阿雯
我拉的基础卡夫卡连接方法的优点同意,但考虑到连接器会需要从依赖于客户端的数量多源拉。我们如何处理这样配置在连接器源凭据,频繁增加和客户的缺失等事物的来源平台的管理似乎是一个挑战。我们如何有效地处理这个问题? –