Job Description
Shape the Future of Intelligence
Join Apex Future Tech, a pioneer in next-generation artificial intelligence, as our Senior Generative AI Architect. We are not just building tools for today; we are architecting the foundational models that will power the world of 2026 and beyond. You will lead a world-class team in developing state-of-the-art Large Language Models (LLMs) and multimodal systems that redefine human-machine interaction.
At Apex, we value radical innovation, technical excellence, and the courage to explore the uncharted territories of machine learning. If you are passionate about pushing the boundaries of what AI can achieve, this is your opportunity to lead the charge.
What You Will Do
Responsibilities
- Model Architecture: Design and implement scalable, production-grade Generative AI architectures (LLMs, Diffusion Models) optimized for speed and efficiency.
- Research & Innovation: Stay at the forefront of AI research, adapting cutting-edge techniques (e.g., Reinforcement Learning from Human Feedback, RAG, Fine-tuning) to solve complex business problems.
- Pipeline Development: Build robust data pipelines and MLOps infrastructure to support the full lifecycle of model development, from experimentation to deployment.
- Team Leadership: Mentor junior engineers and data scientists, conducting code reviews, and fostering a culture of continuous learning and technical excellence.
- Performance Tuning: Optimize inference latency and resource utilization to ensure seamless user experiences across global markets.
- Strategic Planning: Collaborate with product leaders to define the AI roadmap and translate technical feasibility into business value.
Qualifications
- Experience: 5+ years of experience in software engineering or machine learning, with a minimum of 3 years specifically focused on Generative AI or Deep Learning.
- Technical Stack: Proficiency in Python, PyTorch, or TensorFlow. Experience with distributed training frameworks (Ray, Horovod) and cloud platforms (AWS, GCP, Azure).
- Model Expertise: Deep understanding of transformer architectures, attention mechanisms, and prompt engineering strategies.
- Education: Bachelor’s or Master’s degree in Computer Science, Mathematics, or a related field.
- Problem Solving: Strong analytical skills with a proven track record of delivering high-impact projects under tight deadlines.
- Communication: Excellent verbal and written communication skills, capable of explaining complex technical concepts to diverse stakeholders.