On Site
Bangalore
India
3-6 months
Time and material
$ 22-25/Hr
Description
Location: Blore, Any location JD: Lead design and operational considerations for highly available, scalable, and resilient GenAI infrastructure and services. Define and implement site reliability engineering best practices across hybrid and multi-cloud environments. Partner with product, data, and AI engineering teams to ensure infrastructure meets performance, compliance, and security standards.Drive automation of provisioning, monitoring, and incident response to improve efficiency and reduce toil. Oversee observability, monitoring, alerting, and logging frameworks to ensure proactive issue identification and resolution. - IBMFG2JP00003921
Skills:
automation,site reliability engineering,operational considerations,hybrid and multi-cloud environments,monitoring,incident response,provisioning,design

Interested in this project and numerous others like it?

Register on WorkWall now and get started