Job Description
We are seeking a visionary Senior Generative AI Architect to define the technological roadmap for 2026 and beyond. As a key player in our R&D division, you will lead the development of next-generation Large Language Models (LLMs) and autonomous agents that will redefine human-machine interaction. You will work at the intersection of deep learning research and scalable engineering, ensuring our AI solutions are not only state-of-the-art but ethically aligned and commercially viable.
Why join us?
- Work on cutting-edge AI infrastructure that powers the future.
- Competitive equity package and comprehensive benefits.
- Flexible remote-first culture with a hub in the heart of San Francisco.
Responsibilities
- Lead Research & Development: Architect and optimize state-of-the-art transformer architectures and Generative AI models for high-volume production environments.
- System Design: Design scalable MLOps pipelines to handle data ingestion, training, fine-tuning, and deployment of 2026-ready AI agents.
- Prompt Engineering & Optimization: Develop advanced prompt strategies and reinforcement learning from human feedback (RLHF) frameworks to improve model accuracy and safety.
- Cross-Functional Collaboration: Partner with product managers and data scientists to translate complex business requirements into technical AI solutions.
- Performance Tuning: Rigorously test and optimize model inference latency and cost-efficiency to ensure real-time user experiences.
Qualifications
- Education: Masterβs or PhD in Computer Science, Mathematics, or a related field with a focus on Deep Learning.
- Experience: 5+ years of professional experience in Machine Learning, Natural Language Processing (NLP), or Generative AI.
- Technical Skills: Proficiency in Python, PyTorch, TensorFlow, and experience with Hugging Face Transformers.
- Architecture: Strong understanding of distributed systems, cloud infrastructure (AWS/GCP), and containerization (Docker/Kubernetes).
- Innovation: Demonstrated history of publishing research or shipping high-impact AI products.