Senior DevOps Engineer – Remote (Not Specified) | Not Specified | Not Specified
Responsibilities
Infrastructure as Code (IaC) & Automation
- Design, implement, and manage AWS infrastructure using Terraform.
- Automate provisioning and configuration management with Ansible.
- Maintain idempotent and reusable IaC code for environments (dev, staging, prod).
- Implement infrastructure changes via GitOps workflows (Pull Requests and approvals).
- Optimize cloud resources for cost versus performance balance.
Cloud & Server Management (AWS)
- Set up and maintain EC2, EKS, DynamoDB, RDS (PostgreSQL), S3, VPC, IAM, and Route53.
- Design secure networking (VPC, Subnets, Security Groups, VPN, Route53).
- Manage IAM roles, permissions, and policies with the least privilege principle.
- Optimize AWS resources for cost-efficiency (rightsizing, auto-scaling, reserved instances, saving plans).
CI/CD Pipelines
- Design and maintain CI/CD pipelines using GitLab CI or Jenkins.
- Ensure zero-downtime deployments.
Database & Messaging System Management
- Deploy, configure, and optimize RDS PostgreSQL and DynamoDB.
- Implement backups and failover strategies.
- Fine-tune database query performance, caching, and connection pooling.
- Manage RabbitMQ clusters, queues, and message durability.
Monitoring & Observability
- Set up and manage Prometheus, Grafana, and Loki for monitoring and logging.
- Create dashboards and alerts for system health, performance, and failures.
- Use distributed tracing tools (e.g., OpenTelemetry) for debugging.
Security & Compliance Best Practices
- Implement IAM best practices (least privilege, MFA, role-based access control).
- Enforce network security (VPC isolation, security groups, WAF, private endpoints).
- Manage secrets and sensitive data securely via AWS Secrets Manager.
- Ensure automated security patching for OS and dependencies.
- Regularly conduct security audits, vulnerability scans, and compliance checks.
Backup & Disaster Recovery
- Set up automated backups for databases and critical systems.
- Implement disaster recovery plan according to RPO (Recovery Point Objective) and RTO (Recovery Time Objective).
- Test disaster recovery plans with periodic simulations.
About the Company
Intetics Inc. is a leading American technology company specializing in custom software application development, distributed professional teams creation, software product quality assessment, and all-things-digital solutions.