Easter Seals Jobs

Job Information

Calsoft Labs Inc. Site Reliability Engineer in Atlanta, Georgia

*Title: Site Reliability Engineer *Duration: 12 Months **Hybrid Role at Atlanta, GA

As a engineer with Retail, Site Reliability Engineering team, you will be at the forefront of Cloud and Big Data technology. In this role you will establish yourself as a technical leader by exposing yourself to a broad range of industry leading technologies that will help to drive acceleration. The ideal candidate will have expert design and development capabilities and be positioned to contribute to a growing set of services and features for the ecosystem. This role will be supporting highly available, business critical applications. This role will serve as the escalation point for complex and hard to define issues in both on premise and AWS environments.

We are seeking talented engineers, well versed in DevOps technologies, automation, infrastructure orchestration, configuration management, continuous integration, troubleshooting of complex issues, who are not constrained by how things are usually done.

Deep understanding of AWS services (Lambda, S3, SQS, IAM, Route 53 etc.) and proficiency in infrastructure as code (e.g., Terraform, CloudFormation).Hands-on experience with monitoring tools such as CloudWatch, Sumo Logic, Dynatrace, Grafana, or similar for application performance monitoring and alerting. **Proficiency in scripting and automation (e.g., Python, Bash) to build and maintain deployment pipelines and infrastructure. - Strong analytical and troubleshooting skills to diagnose and resolve complex infrastructure and application, data issues. Experience with containerization (Docker, Kubernetes) and serverless architecture (AWS Lambda)

**Required Skillset Manage and optimize data streaming and API components in OpenShift Onpremise and AWS. Proactively review the applications APIs and processes to identify opportunities to optimize the response times for various application components. Automate various types of testing including data quality checks, automate delivery to production and automate deployment for production Develop integrations between the application in Onpremise and AWS and our third-party tools (ServiceNow, VersionOne, Sumo) Work with teams to create SLI/SLOs Actively monitor and lead troubleshooting of degraded performance and hard to define issues for the platform applications, develop the solution and document artifacts in the back log from root cause analysis. []{style="font-weight:40

"}**

DirectEmployers