Job Description
Join the Future of Intelligence. Apex Horizon Systems is pioneering the next generation of autonomous agents and generative workflows. We are looking for a visionary Senior AI Architect to lead the design and deployment of Large Language Models (LLMs) that will redefine enterprise productivity in 2026 and beyond.
You will work at the intersection of research and engineering, building scalable, secure, and high-performance AI systems. If you are passionate about pushing the boundaries of what's possible with Generative AI, this is your opportunity to shape the roadmap.
Why Join Us?
- Competitive compensation package (Salary + Equity).
- Top-tier healthcare and 401(k) matching.
- Remote-first culture with flexible PTO.
- Access to cutting-edge hardware and cloud credits.
The Opportunity:
As a Senior AI Architect, you will be responsible for the full lifecycle of our AI initiatives—from proof-of-concept to production-grade deployment. You will collaborate with cross-functional teams of data scientists, engineers, and product managers to deliver transformative AI solutions.
Responsibilities
- Architect and implement robust LLM pipelines using PyTorch and TensorFlow, ensuring high throughput and low latency.
- Design and optimize Retrieval-Augmented Generation (RAG) architectures to enhance model accuracy and reduce hallucinations.
- Lead the fine-tuning process of open-source models (e.g., Llama 3, Mistral) on proprietary enterprise data.
- Establish MLOps best practices, including model versioning, CI/CD for ML, and automated monitoring for drift and bias.
- Collaborate with product teams to translate complex AI capabilities into user-friendly features.
- Conduct code reviews, technical mentoring, and architecture planning for junior engineers.
- Ensure compliance with data privacy regulations (GDPR, CCPA) in AI model training and inference.
Qualifications
- Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, or a related technical field.
- 7+ years of experience in software engineering with a focus on Machine Learning or Deep Learning.
- Expert-level proficiency in Python, including strong knowledge of async programming and high-performance computing.
- Hands-on experience with Hugging Face Transformers, LangChain, or similar LLM frameworks.
- Strong understanding of distributed systems, cloud infrastructure (AWS/Azure/GCP), and containerization (Docker/Kubernetes).
- Proven track record of deploying production-grade AI models serving thousands of requests per second.
- Experience with vector databases (Pinecone, Weaviate, Milvus) and embedding strategies.
- Excellent communication skills and the ability to articulate complex technical concepts to non-technical stakeholders.