Job Description
We are looking for a Generative AI / LLM Engineer to design, develop, and deploy advanced AI solutions powered by large language models. In this role, you will work on cutting-edge technologies to build intelligent systems for automation, content generation, and decision support. You will play a key role in integrating AI capabilities into products and improving user experiences through scalable and efficient AI architectures.
Key Responsibilities
Develop and deploy applications using large language models (LLMs) such as GPT, LLaMA, or similar architectures.
Design and optimize prompts and workflows to enhance AI output quality and reliability.
Build and implement Retrieval-Augmented Generation (RAG) systems using vector databases and embeddings.
Fine-tune and customize pre-trained models for domain-specific use cases.
Integrate AI models into applications via APIs and backend services.
Work with frameworks like LangChain, LlamaIndex, or Hugging Face to build AI pipelines.
Deploy and manage AI solutions on cloud platforms such as AWS, Azure, or GCP.
Collaborate with cross-functional teams including product, design, and engineering.
Monitor model performance and continuously improve system accuracy and efficiency.
Stay updated with the latest advancements in generative AI and LLM technologies.
Required Skills & Qualifications
Strong experience in Python and AI/ML development.
Hands-on experience with large language models and prompt engineering.
Knowledge of NLP concepts and transformer-based architectures.
Experience with vector databases (Pinecone, FAISS, Weaviate) and semantic search.
Familiarity with frameworks like LangChain, Hugging Face, or similar tools.
Experience with cloud platforms (AWS, Azure, or GCP).
Understanding of APIs, microservices, and scalable system design.
Strong problem-solving and analytical skills.
Ability to work independently and in a team environment.