Total Yrs of Exp - 3 to 4 Yrs
Relv Yrs of Exp - 3+ Yrs. Roles:
Roles covers administration, monitoring, and support of Kafka brokers (including topics, partitions, consumer groups, and ACLs) and Zookeeper nodes. This service is considered critical and requires high availability and reliability to support enterprise applications and real-time data streaming pipelines. Key Responsibilities:
· Perform day-to-day monitoring and health checks for Kafka brokers, topics, partitions, consumer groups, and ACLs.
· Monitor Zookeeper nodes and ensure quorum availability.
· Respond to alerts, incidents, and escalations related to Kafka/Zookeeper.
· Troubleshoot issues related to message delivery, consumer lag, topic unavailability, or partition imbalance.
· Analyze recurring incidents and provide problem management inputs for permanent fixes.
· Configure and manage ACLs to enforce data security and access control.
· Maintain Zookeeper node configuration, monitor leader election, and resolve synchronization issues.
· Support deployment of new Kafka topics, partitions, ACL rules, and Zookeeper changes through the change process.
· Monitor Kafka performance metrics (throughput, latency, broker utilization).
· Ensure cluster scaling and partition rebalancing when required.
· Generate daily/weekly reports on Kafka/Zookeeper health, incidents, and SLA compliance.
Required Skills:
· Hands-on experience with Apache Kafka administration (brokers, topics, partitions, consumer groups, ACLs).
· Strong knowledge of Zookeeper architecture and operational maintenance. Familiarity with Kafka monitoring tools (Confluent Control Center, Grafana, Prometheus, Splunk, or equivalent).
· Good understanding of Kafka Connect, Streams, and Producers/Consumers.
· Basic Linux/Unix administration skills (shell scripting, system logs, process management)
Note - Resource needs to ready for F2F Intv at IBM location based on account request and Day 1 reporting from DOJ.
- IBMFG2JP00002549