Home Job Details
H
Information Technology 🏢 Full Time ⭐️ Verified

Senior AI Infrastructure Engineer

Horizon 2026 Inc.
San Francisco
Estimated Salary
USD 160.000 – USD 230.000
New
Live Update
29 Juni 2026
Deadline
29 Jun 2027

Job Description

We are Horizon 2026 Inc., a forward-thinking technology firm dedicated to architecting the digital infrastructure of the future. We are seeking a visionary Senior AI Infrastructure Engineer to join our elite team in San Francisco. In this role, you will be at the forefront of deploying next-generation machine learning models and ensuring the scalability, security, and performance of our proprietary neural networks.

As a leader in the space, we don't just use existing tools; we define them. You will work in a fast-paced, high-performance environment where your expertise in cloud architecture and AI optimization will directly shape the trajectory of our products. If you are passionate about building resilient systems for the AI era, we want to hear from you.

Responsibilities

  • Design & Deploy: Architect and maintain scalable cloud infrastructure (AWS/GCP) to support high-volume AI workloads and real-time data processing.
  • Model Optimization: Implement and optimize inference pipelines for large language models (LLMs) to ensure low-latency responses and high throughput.
  • System Reliability: Monitor system health using advanced observability tools (Datadog, Prometheus) and implement automated scaling strategies to maintain 99.99% uptime.
  • Security & Compliance: Enforce rigorous security protocols to protect proprietary data and ensure compliance with industry standards (SOC2, GDPR).
  • Collaboration: Partner with data scientists and software engineers to translate research models into production-ready services.
  • Infrastructure as Code: Manage and evolve Terraform configurations to automate environment provisioning and reduce manual overhead.

Qualifications

  • Experience: 5+ years of experience in backend engineering or DevOps with a focus on AI/ML infrastructure.
  • Tech Stack: Proficiency in Python, Docker, Kubernetes, and at least one major cloud provider (AWS, GCP, or Azure).
  • AI/ML Knowledge: Deep understanding of MLOps principles, containerization, and model serving frameworks (TensorFlow Serving, TorchServe).
  • Problem Solving: Strong analytical skills with the ability to troubleshoot complex distributed system issues under pressure.
  • Education: Bachelor’s degree in Computer Science, Engineering, or a related technical field.
  • Communication: Excellent verbal and written communication skills with the ability to articulate technical concepts to non-technical stakeholders.

Required Skills

Kubernetes AWS Python Docker Machine Learning MLOps Terraform CI/CD Linux RESTful APIs Data Pipelines

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All