Site Reliability Engineer

Zimperium

Zimperium

Software Engineering
Dallas, TX, USA
Posted on Jun 28, 2024
Duties:
Will define enterprise-wide applications development strategies to ensure that applications and
infrastructure that are brought to market meet customer requirements and are stable, reliable, and
production ready.
Duties include:
1. Design, code, test and deliver automation tools for production applications deployment and
maintenance;
2. Automate infrastructure creation and provisioning processes in AWS, Azure and OCI cloud
platforms using CloudFormation, Terraform, Ansible, Python and Shell scripting;
3. Develop applications using Java, Spring Boot, Datadog, Elasticsearch, ReactJS and Docker
Compose technologies for monitoring and alerting systems;
4. Enhance automation around configuration management, tooling and striving towards continuous
delivery of software;
5. Migrate docker compose and docker swarm style applications to Kubernetes using Helm Charts
involving working with Kafka, Postgres, RabbiMQ, ELK stack and Redis technologies;
6. Migrate applications between cloud platforms - AWS, Azure, OCI and GCP;
7. Deploy new versions of software releases to production environments and also work on
configuration change requests for production environments; and
8. Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure
of incidents on both public cloud and on premise environments.
Position Requirements:
Master’s Degree in Computer Science, Computer Engineering, Information Quality or a related field plus
1 year of experience in software development roles.
1 year of experience to include the following skills (all skills gained concurrently):
1) Experience with Java, Python and bash scripting languages;
2) Experience with Containerization technologies including Docker, Kubernetes and Swarm;
3) Experience with Terraform, CloudFormation, Ansible and Helm tools for automation;
4) Experience with creating CI/CD pipelines and jobs using Teamcity and Jenkins;
5) Experience with ELK stack, Kafka, RabbitMQ, SNS and SQS messaging technologies;
6) Experience with Postgres, MySQL, MongoDB, Redis and Memcached persistence technologies;
7) Experience with with gradle, maven, Github, Datadog, kibana, splunk, Swagger, Kong
dashboard, Postman and JIRA;
8) Experience with VPC, EC2, ELB, EKS, RDS, IAM, Kinesis, S3, Lambda, CloudFront and
CloudWatch services;
9) Experience with at least one of AWS, Azure or OCI cloud platforms; and
10) Experience with gathering & analyzing metrics using Datadog and to create monitors, synthetics
and dashboards for production applications and systems maintenance.