Hello,
We have an requirement for below position, kindly check and if interested then please share your updated copy of resume
Role: Site Reliability Engineer (SRE)
Location: Providence/ Rhode Island
Duration: Contract
Job Summary:
Seeking a highly skilled and experienced Senior Site Reliability Engineer (SRE) to design, build, and maintain scalable, resilient, and high-performing IT infrastructure. The ideal candidate will have deep expertise in cloud and hybrid environments, automation, observability, and infrastructure operations, with a strong focus on minimizing downtime and improving service reliability.
Key Responsibilities:
1. Infrastructure Design & Implementation
Design and implement highly available and resilient infrastructure solutions.
Ensure system scalability, fault tolerance, and disaster recovery across on-premise, cloud, and hybrid environments.
Define and maintain architecture standards and best practices for infrastructure design.
2. Service Reliability & Performance
Establish and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
Ensure high availability and performance of critical systems and applications.
Collaborate with cross-functional teams to improve overall system reliability.
3. Incident & Problem Management
Proactively monitor and troubleshoot system and infrastructure issues to reduce Mean Time to Recovery (MTTR).
Lead Root Cause Analysis (RCA) for high-priority incidents (P1/P2) and drive preventive actions.
Maintain documentation for incident response procedures and recovery plans.
4. Automation & Infrastructure as Code (IaC)
Develop and implement Infrastructure-as-Code solutions using tools such as Terraform, Ansible, or similar.
Automate routine and repetitive operational tasks to reduce manual intervention and eliminate toil.
Manage version-controlled infrastructure and perform code reviews for IaC deployments.
5. Observability & Monitoring
Set up and manage monitoring and observability tools (e.g., Grafana, Prometheus, ELK stack, Azure Monitor).
Implement logging, metrics, and alerting to ensure visibility into the health and performance of systems.
Ensure early detection of anomalies and quick resolution of production issues.
6. DevOps & CI/CD Integration
Collaborate with DevOps teams to integrate CI/CD pipelines with infrastructure operations.
Support containerized environments using Docker and Kubernetes.
Maintain and optimize deployment processes for both infrastructure and applications.
7. Collaboration & Leadership
Act as a subject matter expert (SME) and provide guidance throughout the solution lifecycle.
Work closely with clients and stakeholders to understand business needs and design appropriate solutions.
Mentor junior engineers and participate in knowledge-sharing initiatives.
Required Skills & Qualifications:
Bachelor's degree in Computer Science, Information Technology, or a related field.
Proven experience managing and scaling IT infrastructure in cloud, on-premise, or hybrid environments.
Strong scripting/programming skills in Python, Bash, or PowerShell.
Proficiency in one or more cloud platforms: AWS, Azure, or GCP.
Expertise in automation tools: Terraform, Ansible, Chef, or similar.
Hands-on experience with observability and monitoring tools (e.g., Grafana, ELK, Prometheus).
Solid understanding of networking, virtualization, and storage technologies.
Familiarity with DevOps practices and container orchestration tools (e.g., Docker, Kubernetes).
Excellent troubleshooting, analytical, and problem-solving skills.
Thanks & Regards,
Rutuja Choudhari || US IT Recruiter
Nityo Infotech Corp
(609) 857-8238
rutuja.choudhari@nityo.com
If you feel you received this email by mistake or wish to unsubscribe, kindly reply to this email with "UNSUBSCRIBE" in the subject line
Hello,
We have an requirement for below position, kindly check and if interested then please share your updated copy of resume
Role: Site Reliability Engineer (SRE)
Location: Providence/ Rhode Island
Duration: Contract
Job Summary:
Seeking a highly skilled and experienced Senior Site Reliability Engineer (SRE) to design, build, and maintain scalable, resilient, and high-performing IT infrastructure. The ideal candidate will have deep expertise in cloud and hybrid environments, automation, observability, and infrastructure operations, with a strong focus on minimizing downtime and improving service reliability.
Key Responsibilities:
1. Infrastructure Design & Implementation
Design and implement highly available and resilient infrastructure solutions.
Ensure system scalability, fault tolerance, and disaster recovery across on-premise, cloud, and hybrid environments.
Define and maintain architecture standards and best practices for infrastructure design.
2. Service Reliability & Performance
Establish and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
Ensure high availability and performance of critical systems and applications.
Collaborate with cross-functional teams to improve overall system reliability.
3. Incident & Problem Management
Proactively monitor and troubleshoot system and infrastructure issues to reduce Mean Time to Recovery (MTTR).
Lead Root Cause Analysis (RCA) for high-priority incidents (P1/P2) and drive preventive actions.
Maintain documentation for incident response procedures and recovery plans.
4. Automation & Infrastructure as Code (IaC)
Develop and implement Infrastructure-as-Code solutions using tools such as Terraform, Ansible, or similar.
Automate routine and repetitive operational tasks to reduce manual intervention and eliminate toil.
Manage version-controlled infrastructure and perform code reviews for IaC deployments.
5. Observability & Monitoring
Set up and manage monitoring and observability tools (e.g., Grafana, Prometheus, ELK stack, Azure Monitor).
Implement logging, metrics, and alerting to ensure visibility into the health and performance of systems.
Ensure early detection of anomalies and quick resolution of production issues.
6. DevOps & CI/CD Integration
Collaborate with DevOps teams to integrate CI/CD pipelines with infrastructure operations.
Support containerized environments using Docker and Kubernetes.
Maintain and optimize deployment processes for both infrastructure and applications.
7. Collaboration & Leadership
Act as a subject matter expert (SME) and provide guidance throughout the solution lifecycle.
Work closely with clients and stakeholders to understand business needs and design appropriate solutions.
Mentor junior engineers and participate in knowledge-sharing initiatives.
Required Skills & Qualifications:
Bachelor's degree in Computer Science, Information Technology, or a related field.
Proven experience managing and scaling IT infrastructure in cloud, on-premise, or hybrid environments.
Strong scripting/programming skills in Python, Bash, or PowerShell.
Proficiency in one or more cloud platforms: AWS, Azure, or GCP.
Expertise in automation tools: Terraform, Ansible, Chef, or similar.
Hands-on experience with observability and monitoring tools (e.g., Grafana, ELK, Prometheus).
Solid understanding of networking, virtualization, and storage technologies.
Familiarity with DevOps practices and container orchestration tools (e.g., Docker, Kubernetes).
Excellent troubleshooting, analytical, and problem-solving skills.
Thanks & Regards,
Rutuja Choudhari || US IT Recruiter
Nityo Infotech Corp
(609) 857-8238
rutuja.choudhari@nityo.com
If you feel you received this email by mistake or wish to unsubscribe, kindly reply to this email with "UNSUBSCRIBE" in the subject lin
...for a place where your efforts are not only appreciated but celebrated? Nexus is the place for you! We are looking for a full-desk recruiter to join our amazing team! Youll have the freedom to build and run your own desk with the support of a sage (who are you calling...
...The Safety Manager will oversee and coordinate all safety programs and initiatives for both public and private construction projects. This role ensures compliance with OSHA and MSHA regulations, implements best practices for site safety, and promotes a culture of safety...
...Background & Experience ~13 years of experience in management consulting, preferably at a top-tier firm. ~ An advanced degree or MBA from a top-ranked university. ~ Experience in the Life Sciences sector is a plus, but not required. ~ Adaptability and motivation...
...CANOPY CLEANING TEAM EMPLOYMENT TYPE: Part-time COMPENSATION: $20 per hour The... ...hiring a Cleaning Team Member to join our evening crew at our beautiful play space and... ...0 PM . Candidates must be available to work at least one weekend evening to be considered...
...professional and financial goals. Successful candidates will: Identify and cultivate relationships with real estate advisors/brokers, sponsors, and developers that can evolve into capital advisory opportunities. Manage the full deal cycle from outreach to...