Job Description
Are you ready to architect the next generation of intelligent systems? Nexus Future Tech is seeking a visionary Senior AI Engineer to join our elite R&D division. We are at the forefront of the AI revolution, developing cutting-edge Generative AI models that are reshaping industries.
In this role, you won't just maintain existing systems; you will pioneer new methodologies in Large Language Models (LLMs), Computer Vision, and Autonomous Agents. If you are passionate about pushing the boundaries of what is possible with Artificial Intelligence and want to work in a high-performance, innovative environment, we want to meet you.
Why join Nexus Future Tech?
- Work with state-of-the-art infrastructure (NVIDIA H100 clusters).
- Competitive equity package and remote-first flexibility.
- Opportunity to publish research and influence the global AI landscape.
Join us in building the future of intelligence.
Responsibilities
- Model Architecture: Design, train, and fine-tune large-scale deep learning models, with a focus on Generative AI and LLMs.
- System Optimization: Improve inference latency and scalability of deployed AI models using techniques like quantization, pruning, and distillation.
- Research & Development: Stay ahead of the curve by implementing the latest academic papers into production pipelines.
- MLOps: Build robust CI/CD pipelines for model deployment, monitoring, and A/B testing using Kubernetes and Docker.
- Cross-functional Leadership: Collaborate with product managers and data scientists to define technical roadmaps and solve complex business problems.
- Code Quality: Write clean, maintainable, and well-documented Python code; conduct code reviews for junior engineers.
Qualifications
- Education: Masterβs or PhD degree in Computer Science, Mathematics, or a related field (or equivalent practical experience).
- Technical Expertise: 5+ years of professional experience in Machine Learning, Deep Learning, or AI Engineering.
- Languages: Proficiency in Python (PyTorch or TensorFlow preferred) and C++ for performance optimization.
- Frameworks: Strong hands-on experience with Hugging Face Transformers, LangChain, and major cloud platforms (AWS/GCP/Azure).
- Problem Solving: Demonstrated ability to debug complex distributed systems and optimize algorithms for production environments.
- Communication: Excellent verbal and written communication skills; ability to present technical concepts to non-technical stakeholders.