CloudOps Engineer
BOS Framework
About BOS
BOS Framework is a Cloud infrastructure and DevOps automation platform that enables tech teams to provision, configure and orchestrate their application and data environments in AWS/Azure with built-in observability, resilience, and compliance, without having to learn IaC or DevOps on the job.
Creating Massive Impact
With BOS, tech-enabled businesses greatly reduce technical debt, assure on-going 99.99% uptime, gain release cycle efficiencies, and save 30 to 80% of the cost and time that goes into building, migrating, and maintaining Cloud environments with fewer tools and resources.
Roles & Responsibilities
- Python developer with good exposure on docker, managed kubernetes in AWS and Azure, terraform, ansible, open-source log exporters (+aggregators), open-source metrics exporters (+aggregators)
- RouteTable, Gateway, ACL, Route53, Certificate Management
- Amazon AWS
- Pipeline - Application (Build, Artifact Mgmt, Deployment, Report, Monitor, Listen), Infrastructure (Pull State, Synchronize, Destroy, Apply, Version, PushState)
- Running Python API, Service, Cron Jobs as a Docker container
- Identify, design, and deploy appropriate monitoring/alerting of service events related to performance, scalability, availability, and reliability
- Provide 24/7 support as part of an on-call rotation team that responds to afterhours infrastructure alerts and calls from the application support team requiring infrastructure support.
- Define and maintain critical cloud policies regarding Security, Service Level Agreements, Disaster Recovery, Shared Responsibility, Acceptable Use, Data Retention etc
- Powershell scripts, custom modules
- FluentD - .Net Core log exporters (approach to capture the console logs, no code changes in app but still logs are exported to aggregators like Elasticsearch)
- FluentD - capturing API stats from a .Net Core app
Requirements
- Bachelor's degree in Computer Science or related technical field, or equivalent practical experience
- At least 5 years of experience working on public clouds: AWS or Azure
- 2+ years experience providing site reliability engineering to a SaaS application
- Bachelor's degree in computer science, information technology, or a related field
- Experience with GCP or another cloud platform administration
- Must be passionate about trouble-shooting
- Experience with scripting languages such as Bash, Python, Powershell
- Knowledge of or desire to learn
- Containerization technologies such as Docker and Kubernetes
- Git, GitHub and related tools
- Application build pipeline tools such as Jenkins
Benefits
- 100% Company paid comprehensive medical insurance for you, your spouse, and children
- Paid time off
- Market competitive total compensation package
- Paid Maternity & Parental leave
- Your voice is heard; no matter your level, we're a team, all going in the same direction
Core Values
- Customer First: Putting Customers at the Heart: We place our clients at the forefront, responding to their needs with respect and efficency. Our growth is intertwined with our customers' success.
- Walk the Talk: Integrity in Action: Our words and actions align, fostering trust through transparency and long-term commitment. We embrace courage and honesty for the greater good.
- Team Spirit: Unity in Diversity: We champion collaboration across departments and locations, creating win-win situations and extending our team spirit to include our clients. Together, we find strength in unity.
- Excellence: Pursuit of Perfection: Our journey is marked by a relentless drive to surpass our acheivements, embracing each day as an oppurtunity to excel further.
- Drive Innovation: Innovative Mindset: We stay ahead of global tech trends, challenging the status quo with audacity and delivering cutting-edge solutions that drive growth.
- Outcome-Focused: Results-Driven Approach: We prioritize impactful solutions and maintain a balance between visionary objectives and immediate achievements, ensuring practicality in our pursuit of excellence.