About Docsumo:
Docsumo is a Document Workflow platform that converts unstructured documents (like bank statements, financials, policies) into structured, actionable data with the help of Agentic Workflows. We’re backed by Sequoia, Barclays, Fifth Wall, Common Ocean, and Techstars — and trusted by leading banks, insurers, and fintechs worldwide.
The opportunity as Senior DevOps / SRE Engineer:
We’re looking for a Senior SRE (Python) to lead a small team (2 engineers) and own the reliability, deployment, and automation of our AI platform. You’ll work hands-on with Kubernetes, GCP, AWS, Python (Flask/FastAPI) and ensure our infrastructure and applications run securely, reliably, and at scale.
Key Responsibilities:
- Lead SRE initiatives and mentor 2 junior engineers.
- Own deployments and monitoring across GCP (K8s, Cloud Run, VPC, networking) and AWS (Lambda, SES).
- Debug & fix issues in Python apps (Flask, FastAPI), with occasional Lua for canary deployments.
- Set up automation, infra-as-code, CI/CD pipelines, and incident response.
- Optimize for cost, performance, and reliability across infra and applications.
- Work closely with backend engineers, product, and operations to keep our services running smoothly.
Need to have:
- 4+ years in SRE/DevOps with strong Python scripting & backend debugging skills.
- Hands-on with Kubernetes, Docker, and cloud infra (GCP & AWS).
- Experience with MongoDB, Elastic, monitoring tools (Prometheus, Grafana).
- Strong troubleshooting, debugging, and problem-solving skills.
- Ability to lead small teams and drive reliability culture.
Nice to have:
- Experience with Temporal, Redis, or serverless (Cloud Run, Lambda).
- Exposure to high-traffic SaaS or AI/ML infrastructure.
- Prior team leadership/mentorship experience.
Why join us?
- Lead the SRE charter and shape reliability for our platform.
- Work on modern infra (K8s, Cloud-native, Temporal, serverless).
- High ownership, visible impact — report directly to Engineering leadership.
- Opportunity to grow into Principal Engineer / SRE Manager.
- Fast-paced startup, strong learning curve, and a collaborative culture.