Job Description
Are you ready to architect the backbone of the next generation of artificial intelligence? Nexus Core Systems is looking for a Senior AI Infrastructure Engineer to lead our infrastructure initiatives for the 2026 era. We are building the future of scalable, low-latency neural networks, and we need a technical visionary to ensure our systems are resilient, efficient, and ready for the demands of tomorrow.
In this role, you will bridge the gap between cutting-edge machine learning research and robust, production-grade software engineering. You will optimize our GPU clusters, streamline data pipelines, and design microservices that power our predictive AI models.
Responsibilities
- Design and implement scalable infrastructure for training and deploying large-scale AI models.
- Optimize database performance and data pipelines for real-time inference.
- Collaborate with data scientists to translate research into reliable production systems.
- Implement security best practices and ensure compliance with data governance standards.
- Drive the migration to cloud-native architectures to reduce latency and improve scalability.
- Monitor system health and troubleshoot complex distributed system issues.
Qualifications
- 5+ years of experience in software engineering or infrastructure architecture, with a focus on AI/ML.
- Strong proficiency in Python, Go, or Rust.
- Deep understanding of containerization (Docker/Kubernetes) and orchestration.
- Experience with cloud platforms (AWS, GCP, or Azure) and serverless computing.
- Familiarity with distributed computing frameworks (Apache Spark, Ray) is a plus.
- Bachelor’s degree in Computer Science, Engineering, or a related technical field.
- Experience with high-performance computing (HPC) environments and GPU clusters.