Job Overview:
We are seeking an experienced Senior Cloud Engineer (AWS) with a strong focus on
infrastructure, maintenance, and operations. This role involves managing and
optimizing AWS cloud environments, ensuring availability, performance, and security
while supporting the organization's continuous operations. The ideal candidate will
have a solid background in DevOps practices and cloud infrastructure management,
with a focus on incident management, system reliability, and automation.
Key Responsibilities:
1. AWS Cloud Maintenance and Operations
Oversee daily operations of AWS environments, ensuring systems are
functional, secure, and optimized.
Perform patch management, backups, and system updates with minimal
disruption to services.
Monitor system performance and usage, addressing issues proactively to
minimize downtime.
2. Incident and Problem Management
Troubleshoot and resolve complex technical issues related to AWS
infrastructure and applications.
Act as an escalation point for operational incidents, driving root cause analysis
and permanent solutions.
Develop and maintain incident response and disaster recovery plans.
3. Infrastructure Optimization
Continuously evaluate and implement cost-saving measures, such as right-
sizing instances and leveraging reserved or spot instances.
Analyze performance metrics to identify opportunities for improvement in
availability and scalability.
4. Automation and DevOps
Automate routine operational tasks using Infrastructure as Code (IaC) tools
such as Terraform, CloudFormation, or CDK.
Build and maintain CI/CD pipelines to streamline deployments and minimize
manual intervention.
Implement monitoring and alerting solutions using AWS CloudWatch, ELK stack,
or similar tools.
5. Security and Compliance
Ensure adherence to security best practices, including IAM, VPC configuration,
and encryption.
Conduct regular security audits and remediations to meet compliance
standards like SOC 2, HIPAA, or ISO 27001.
6. Collaboration and Documentation
Work closely with development, operations, and product teams to support
ongoing cloud initiatives.
Maintain detailed documentation for infrastructure, processes, and incident
resolution.
Required Skills and Qualifications:
Experience:
6+ years of experience managing AWS cloud environments.
Proven experience in maintenance, operations, and incident management.
Technical Expertise:
Strong knowledge of AWS core services: EC2, S3, RDS, Lambda, CloudFront, and
Route 53.
Experience with IaC tools like Terraform, CloudFormation, or CDK.
Proficient in scripting languages (e.g., Python, Bash, or PowerShell).
Knowledge of containerization tools (Docker, ECS, or EKS).
Familiarity with DevOps tools such as Jenkins, GitLab CI/CD, or AWS
CodePipeline.
Preferred Qualifications:
AWS Certifications (e.g., Solutions Architect, SysOps Administrator).
Experience with multi-cloud environments.
Familiarity with tools like Ansible for configuration management
Interpersonal Skills:
Willingness and acumen to learn and adopt new technologies/enhancements
Positive attitude towards work and team
Ability to break complex problems into pieces and approach solutions.
Ability to accept new challenges in the role.
Ability to independently work on complex projects related to services with little
or no assistance from others.
Ability to understand the business requirements thoroughly and convert those
requirements into proper technical solutions to achieve the required goals.
Able to identify the gaps in the processes and propose appropriate changes
required for improvement.
Able to identify new opportunities for technological improvements and as well
optimally utilize the existing used features
