Expoint – all jobs in one place
Finding the best job has never been easier

Devops jobs in United States, California, Palo Alto

Unlock your potential in the high tech industry with Expoint. Search for job opportunities as a Devops in United States, California, Palo Alto and join the network of leading companies. Start your journey today and find your dream job as a Devops with Expoint.
Company
Job type
Job categories
Job title (1)
United States
California
Palo Alto
3 jobs found
07.05.2025
S

Salesforce Lead DevOps Engineer - AI Research Incubation United States, California, Palo Alto

Limitless High-tech career opportunities - Expoint
Bachelor’s degree in Computer Science, Software Engineering, or a related field. 3+ years of experience in DevOps, cloud infrastructure, or site reliability engineering. Strong experience with AWS and GCP, including...
Description:

Job Category

Software Engineering

Job Details

About the Role

Key Responsibilities
Design, implement, and manage cloud infrastructure (AWS, GCP) including networking, security, and compute resources.
Develop and maintain CI/CD pipelines to automate deployment and testing of AI models and applications.
Build, manage, and optimize Kubernetes clusters for deploying AI services and research applications.
Implement infrastructure as code (IaC) using Terraform and Helm to ensure repeatable and scalable deployments.
Automate system operations and monitoring using Python and shell scripting.
Ensure security standard methodologies across cloud environments, including firewall and access control management.
Solve infrastructure issues and optimize system performance.
Collaborate with AI researchers and software engineers to streamline model deployment and integration.
Task about managing databases (SQL and No-SQL), including database provisioning, performance tuning, and backup strategies.
Ensure database security, replication, and high availability across cloud environments.

Required Qualifications

  • Bachelor’s degree in Computer Science, Software Engineering, or a related field.
  • 3+ years of experience in DevOps, cloud infrastructure, or site reliability engineering.
  • Strong experience with AWS and GCP, including DNS, VM management, networking, Kubernetes, and firewall security.
  • Proficiency in CI/CD pipeline development and automation (GitHub Actions, Jenkins, GitLab CI/CD, etc.).
  • Expertise in Docker, Kubernetes, and Helm for container orchestration and deployment.
  • Hands-on experience with Terraform for infrastructure provisioning and management.
  • Strong scripting skills in Python and shell scripting for automation.
  • Solid understanding of networking, security standard processes, and cloud monitoring tools.
  • Excellent troubleshooting and problem-solving skills.

Preferred Qualifications

  • Experience with AI/ML model deployment and pipeline automation.
  • Knowledge of logging and monitoring tools (Prometheus, Grafana, ELK stack, etc.).
  • Familiarity with serverless computing and cloud-native application design.
  • Contributions to open-source DevOps tools or frameworks.
  • Experience with Salesforce Falcon is a plus.

If you require assistance due to a disability applying for open positions please submit a request via this.

Posting Statement

does not accept unsolicited headhunter and agency resumes.

Show more
13.04.2025
JPM

JPMorgan Site Reliability Engineer III- DevOps United States, California, Palo Alto

Limitless High-tech career opportunities - Expoint
Design, implement, and manage scalable, reliable, and secure cloud infrastructure on AWS, including deploying and scaling containerized applications using Kubernetes (EKS) and ECS. Develop and maintain infrastructure as code using...
Description:

Job responsibilities

  • Design, implement, and manage scalable, reliable, and secure cloud infrastructure on AWS, including deploying and scaling containerized applications using Kubernetes (EKS) and ECS.
  • Develop and maintain infrastructure as code using Terraform to automate provisioning and configuration management, ensuring efficient and consistent deployments.
  • Monitor system performance, optimize EKS workloads, and implement solutions to improve reliability and performance, including autoscaling and disaster recovery strategies.
  • Implement logging and tracing using tools like ELK Stack, Splunk, Dynatrace, and AWS CloudWatch to ensure comprehensive monitoring and alerting.
  • Integrate security tools such as SonarQube, Snyk, Trivy, and Aqua Security into CI/CD pipelines using Jenkins or AWS CodePipeline, and define automated rollback policies in Spinnaker.
  • Collaborate with development teams to ensure smooth deployment and operation of applications, implementing and managing CI/CD pipelines to streamline the software development lifecycle.
  • Troubleshoot and resolve infrastructure-related issues promptly, while continuously evaluating and implementing modern technologies and tools to improve operational efficiency.

Required qualifications, capabilities, and skills

  • Formal training or certification on Site Reliability concepts and 3+ years applied experience
  • Strong expertise in AWS services, including EC2, S3, RDS, VPC, IAM, and networking, with hands-on experience in Kubernetes (EKS) and ECS for container orchestration.
  • Proficiency in using Terraform for infrastructure as code and a solid understanding of CI/CD concepts and tools such as Jenkins, GitLab CI, CircleCI, AWS CodePipeline, and Spinnaker.
  • Experience with monitoring and logging tools like Prometheus, Grafana, ELK Stack, CloudWatch, and observability practices including white and black box monitoring and telemetry collection.
  • Strong scripting skills in languages such as Python, Bash, or Go, and proficiency in at least one programming language such as Python, Java/Spring Boot, or .Net.
  • Excellent problem-solving skills, attention to detail, and the ability to troubleshoot common networking technologies and issues.
  • Strong communication and collaboration skills, with the ability to contribute to large and collaborative teams by presenting information logically and compellingly.
  • Proficient in site reliability culture and principles, with familiarity in implementing site reliability within an application or platform.
  • Proficient knowledge of software applications and technical processes within a given technical discipline, such as Cloud or artificial intelligence.
  • Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform, and familiarity with container orchestration using ECS, Kubernetes, and Docker.
  • Ability to proactively recognize roadblocks, demonstrate interest in learning technology that facilitates innovation, and initiate and implement ideas to solve business problems.
Preferred qualifications, capabilities, and skills
  • Possession of AWS Certified Solutions Architect or DevOps Engineer certification, demonstrating advanced expertise in AWS services.
  • Strong problem-solving skills with the ability to troubleshoot complex CI/CD issues effectively and efficiently.
  • Proficiency in Java or scripting languages such as Bash, Node.JS, Shell, and Python, showcasing versatility in programming.
  • Experience with microservices architecture and serverless computing, enabling scalable and efficient application development.
  • Familiarity with security best practices in cloud environments, ensuring robust and secure infrastructure management.
  • Demonstrated leadership skills and experience in mentoring engineers, fostering a collaborative and growth-oriented team environment.
Show more

These jobs might be a good fit

Limitless High-tech career opportunities - Expoint
Bachelor’s degree in Computer Science, Software Engineering, or a related field. 3+ years of experience in DevOps, cloud infrastructure, or site reliability engineering. Strong experience with AWS and GCP, including...
Description:

Job Category

Software Engineering

Job Details

About the Role

Key Responsibilities
Design, implement, and manage cloud infrastructure (AWS, GCP) including networking, security, and compute resources.
Develop and maintain CI/CD pipelines to automate deployment and testing of AI models and applications.
Build, manage, and optimize Kubernetes clusters for deploying AI services and research applications.
Implement infrastructure as code (IaC) using Terraform and Helm to ensure repeatable and scalable deployments.
Automate system operations and monitoring using Python and shell scripting.
Ensure security standard methodologies across cloud environments, including firewall and access control management.
Solve infrastructure issues and optimize system performance.
Collaborate with AI researchers and software engineers to streamline model deployment and integration.
Task about managing databases (SQL and No-SQL), including database provisioning, performance tuning, and backup strategies.
Ensure database security, replication, and high availability across cloud environments.

Required Qualifications

  • Bachelor’s degree in Computer Science, Software Engineering, or a related field.
  • 3+ years of experience in DevOps, cloud infrastructure, or site reliability engineering.
  • Strong experience with AWS and GCP, including DNS, VM management, networking, Kubernetes, and firewall security.
  • Proficiency in CI/CD pipeline development and automation (GitHub Actions, Jenkins, GitLab CI/CD, etc.).
  • Expertise in Docker, Kubernetes, and Helm for container orchestration and deployment.
  • Hands-on experience with Terraform for infrastructure provisioning and management.
  • Strong scripting skills in Python and shell scripting for automation.
  • Solid understanding of networking, security standard processes, and cloud monitoring tools.
  • Excellent troubleshooting and problem-solving skills.

Preferred Qualifications

  • Experience with AI/ML model deployment and pipeline automation.
  • Knowledge of logging and monitoring tools (Prometheus, Grafana, ELK stack, etc.).
  • Familiarity with serverless computing and cloud-native application design.
  • Contributions to open-source DevOps tools or frameworks.
  • Experience with Salesforce Falcon is a plus.

If you require assistance due to a disability applying for open positions please submit a request via this.

Posting Statement

does not accept unsolicited headhunter and agency resumes.

Show more
Find your next career move in the high tech industry with Expoint. Our platform offers a wide range of Devops job opportunities in the United States, California, Palo Alto area, giving you access to the best companies in the field. Whether you're looking for a new challenge or a change of scenery, Expoint makes it easy to find your perfect job match. With our easy-to-use search engine, you can quickly find job opportunities in your desired location and connect with top companies. Sign up today and take the next step in your high tech career with Expoint.