
Remote
Full-Time
India
Skills
Test Automation
Continuous Integration and Continuous Delivery (CI/CD)
Infrastructure as code (IaC)
Configuration Management
Industry standards
About the Role
The LLMDevops/DevOps / CI/CD Engineer will be responsible for designing, implementing, and managing the CI/CD pipelines and infrastructure automation. This role involves collaborating with development, operations, and security teams to ensure smooth and efficient software delivery processes. The candidate will have a Bachelor's or Master's degree in Computer Science, Information Technology, or a related field and extensive experience in DevOps practices, CI/CD tools, and cloud platforms
Responsibilities:
Deploy, scale, and manage large language models (LLMs) in production environments, ensuring optimal resource usage and performance.
Design, implement, and manage CI/CD pipelines to automate the software delivery process, ensuring fast and reliable deployments.
Monitor and analyze model performance in real-time, addressing issues like model drift, latency, and accuracy degradation, and initiating model retraining or adjustments when necessary.
Manage cloud environments (AWS, GCP, Azure etc.,) to provision and scale infrastructure to meet the needs of training, fine-tuning, and inference for large models.
Collaborate with development teams to integrate CI/CD pipelines into the development workflow, promoting continuous integration and delivery best practices.
Implement infrastructure as code (IaC) using tools such as Terraform, Ansible, or CloudFormation to automate the provisioning and management of infrastructure.
Manage and maintain cloud infrastructure on platforms such as AWS, Azure, or Google Cloud, ensuring scalability, security, and reliability.
Develop and implement monitoring, logging, and alerting solutions to ensure the health and performance of applications and infrastructure.
Work closely with security teams to integrate security practices into the CI/CD pipelines, ensuring compliance with industry standards and regulations.
Optimize build and release processes to improve efficiency and reduce deployment times, implementing strategies such as parallel builds and incremental deployments.
Automate testing processes within the CI/CD pipelines to ensure high-quality software releases, including unit tests, integration tests, and performance tests.
Manage and monitor version control systems, such as Git, to ensure code integrity and facilitate collaboration among development teams.
Provide technical support and troubleshooting for CI/CD-related issues, ensuring timely resolution and minimal disruption to development workflows.
Develop and maintain documentation for CI/CD pipelines, infrastructure configurations, and best practices, ensuring clarity and accessibility for team members.
Stay updated on the latest trends and advancements in DevOps, CI/CD, and cloud computing, and incorporate new tools and practices into the organization's workflows.
Lead and participate in code reviews and technical discussions, providing insights and recommendations for continuous improvement.
Conduct training sessions and workshops for internal teams to promote knowledge sharing and best practices in DevOps and CI/CD.
Collaborate with IT and development teams to implement and manage containerization solutions using Docker and orchestration platforms such as Kubernetes.
Implement and manage configuration management solutions to maintain consistency and manage changes across environments.
Develop and implement disaster recovery and business continuity plans to ensure the resilience and availability of applications and infrastructure.
Optimize resource utilization and cost management for cloud infrastructure, implementing strategies such as auto-scaling and resource tagging.
Facilitate communication between development, operations, and business stakeholders to ensure alignment on DevOps goals and practices.
Participate in the evaluation and selection of DevOps tools and technologies that align with organizational goals and improve software delivery processes.
Manage and monitor application performance, implementing strategies to optimize performance and resolve bottlenecks.
Ensure compliance with organizational policies and industry regulations related to software development and deployment.
Requirements:
Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.
Extensive experience in DevOps practices and CI/CD implementation.
Strong proficiency in CI/CD tools such as Jenkins, GitLab CI, or CircleCI.
Experience with cloud platforms such as AWS, Azure, or Google Cloud.
Proficiency in infrastructure as code (IaC) tools such as Terraform, Ansible, or CloudFormation.
Strong understanding of containerization and orchestration platforms such as Docker and Kubernetes.
Experience with monitoring, logging, and alerting tools such as Prometheus, Grafana, ELK Stack, or Datadog.
Proficiency in scripting languages such as Python, Bash, or PowerShell.
Strong understanding of version control systems such as Git.
Excellent problem-solving and analytical skills, with the ability to troubleshoot and resolve technical issues.
Strong communication and collaboration skills, with the ability to work effectively with cross-functional teams and stakeholders.
Certification in DevOps or cloud platforms (e.g., AWS Certified DevOps Engineer, Azure DevOps Engineer Expert) is preferred.
Responsibilities:
Deploy, scale, and manage large language models (LLMs) in production environments, ensuring optimal resource usage and performance.
Design, implement, and manage CI/CD pipelines to automate the software delivery process, ensuring fast and reliable deployments.
Monitor and analyze model performance in real-time, addressing issues like model drift, latency, and accuracy degradation, and initiating model retraining or adjustments when necessary.
Manage cloud environments (AWS, GCP, Azure etc.,) to provision and scale infrastructure to meet the needs of training, fine-tuning, and inference for large models.
Collaborate with development teams to integrate CI/CD pipelines into the development workflow, promoting continuous integration and delivery best practices.
Implement infrastructure as code (IaC) using tools such as Terraform, Ansible, or CloudFormation to automate the provisioning and management of infrastructure.
Manage and maintain cloud infrastructure on platforms such as AWS, Azure, or Google Cloud, ensuring scalability, security, and reliability.
Develop and implement monitoring, logging, and alerting solutions to ensure the health and performance of applications and infrastructure.
Work closely with security teams to integrate security practices into the CI/CD pipelines, ensuring compliance with industry standards and regulations.
Optimize build and release processes to improve efficiency and reduce deployment times, implementing strategies such as parallel builds and incremental deployments.
Automate testing processes within the CI/CD pipelines to ensure high-quality software releases, including unit tests, integration tests, and performance tests.
Manage and monitor version control systems, such as Git, to ensure code integrity and facilitate collaboration among development teams.
Provide technical support and troubleshooting for CI/CD-related issues, ensuring timely resolution and minimal disruption to development workflows.
Develop and maintain documentation for CI/CD pipelines, infrastructure configurations, and best practices, ensuring clarity and accessibility for team members.
Stay updated on the latest trends and advancements in DevOps, CI/CD, and cloud computing, and incorporate new tools and practices into the organization's workflows.
Lead and participate in code reviews and technical discussions, providing insights and recommendations for continuous improvement.
Conduct training sessions and workshops for internal teams to promote knowledge sharing and best practices in DevOps and CI/CD.
Collaborate with IT and development teams to implement and manage containerization solutions using Docker and orchestration platforms such as Kubernetes.
Implement and manage configuration management solutions to maintain consistency and manage changes across environments.
Develop and implement disaster recovery and business continuity plans to ensure the resilience and availability of applications and infrastructure.
Optimize resource utilization and cost management for cloud infrastructure, implementing strategies such as auto-scaling and resource tagging.
Facilitate communication between development, operations, and business stakeholders to ensure alignment on DevOps goals and practices.
Participate in the evaluation and selection of DevOps tools and technologies that align with organizational goals and improve software delivery processes.
Manage and monitor application performance, implementing strategies to optimize performance and resolve bottlenecks.
Ensure compliance with organizational policies and industry regulations related to software development and deployment.
Requirements:
Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.
Extensive experience in DevOps practices and CI/CD implementation.
Strong proficiency in CI/CD tools such as Jenkins, GitLab CI, or CircleCI.
Experience with cloud platforms such as AWS, Azure, or Google Cloud.
Proficiency in infrastructure as code (IaC) tools such as Terraform, Ansible, or CloudFormation.
Strong understanding of containerization and orchestration platforms such as Docker and Kubernetes.
Experience with monitoring, logging, and alerting tools such as Prometheus, Grafana, ELK Stack, or Datadog.
Proficiency in scripting languages such as Python, Bash, or PowerShell.
Strong understanding of version control systems such as Git.
Excellent problem-solving and analytical skills, with the ability to troubleshoot and resolve technical issues.
Strong communication and collaboration skills, with the ability to work effectively with cross-functional teams and stakeholders.
Certification in DevOps or cloud platforms (e.g., AWS Certified DevOps Engineer, Azure DevOps Engineer Expert) is preferred.
Apply for this position
Application Status
Application Draft
In Progress
Submit Application
Pending
Review Process
Expected within 5-7 days
Similar Jobs




