Job Description
We are Horizon Tech, a pioneering force in next-generation generative intelligence. We are seeking a visionary Senior AI/LLM Engineer to join our elite team in Austin, Texas. In this role, you will be instrumental in architecting the future of our AI infrastructure, pushing the boundaries of what is possible with Large Language Models (LLMs) and generative AI.
If you are passionate about deploying state-of-the-art AI solutions and want to work in an environment that prioritizes innovation and technical excellence, we want to meet you.
Responsibilities
- Model Architecture & Optimization: Design, train, and fine-tune large-scale language models (e.g., Llama, Mistral, GPT-based) to achieve superior performance and accuracy.
- RAG Implementation: Build robust Retrieval-Augmented Generation (RAG) pipelines to enhance model outputs with real-time, context-aware knowledge retrieval.
- Inference Optimization: Engineer efficient inference systems to reduce latency and operational costs in high-volume production environments.
- Research & Development: Stay at the forefront of AI research, experiment with novel architectures, and implement cutting-edge techniques into our product stack.
- Collaboration: Partner with product managers, data scientists, and software engineers to translate complex AI capabilities into user-centric features.
Qualifications
- Education: Masterβs or PhD in Computer Science, Artificial Intelligence, or a related quantitative field.
- Experience: 5+ years of professional experience in machine learning, deep learning, or natural language processing.
- Technical Stack: Strong proficiency in Python, PyTorch, TensorFlow, or JAX.
- LLM Expertise: Deep understanding of transformer architectures, attention mechanisms, and prompt engineering.
- Infrastructure: Experience with cloud platforms (AWS/GCP/Azure) and vector databases (Pinecone, Milvus, Weaviate).