Job Description
Are you ready to define the next era of artificial intelligence? We are seeking a visionary Senior Generative AI Engineer to join our elite team in San Francisco. As we look toward 2026, we are building the infrastructure for the AI-native world. In this role, you will architect and deploy state-of-the-art Large Language Models (LLMs) that power enterprise-grade applications, ensuring they are not only powerful but also ethical and scalable.
We are looking for a builder who thrives in a fast-paced, high-tech environment and is passionate about the future of Machine Learning.
Responsibilities
- Architect LLM Solutions: Design and implement scalable Generative AI architectures using modern frameworks and transformer models.
- Optimize Performance: Reduce inference latency and optimize model cost-efficiency through techniques like quantization and distillation.
- Build RAG Pipelines: Develop robust Retrieval-Augmented Generation systems to enhance the accuracy and context-awareness of AI outputs.
- MLOps Implementation: Establish CI/CD pipelines for machine learning models, ensuring seamless deployment and monitoring in production environments.
- Collaborate & Innovate: Work closely with data scientists, product managers, and security teams to translate complex AI capabilities into user-centric features.
Qualifications
- Education: Masterβs or PhD in Computer Science, Artificial Intelligence, or a related technical field.
- Experience: 5+ years of professional experience in Deep Learning, Natural Language Processing (NLP), or Generative AI.
- Technical Skills: Expert proficiency in Python, PyTorch, TensorFlow, or Hugging Face Transformers.
- Tools: Strong experience with cloud platforms (AWS/GCP/Azure), vector databases (Pinecone/Milvus), and containerization (Docker/Kubernetes).
- Soft Skills: Excellent problem-solving abilities and the capacity to communicate complex technical concepts to non-technical stakeholders.