via Truemote
$Not specified
Design and implement enterprise-scale AI solutions using LLMs and RAG, integrating AI models into enterprise systems and managing their lifecycle in cloud environments.
8+ years software/data architecture experience including 3+ years in GenAI/LLMs, strong Python and cloud platform skills, and hands-on experience with AI model operationalization tools.
Seeking a Generative AI Architect to design and implement enterprise-scale AI solutions using LLMs, RAG, and the latest AI platforms like Gemini, Azure ML, OpenAI, and Anthropic. The role involves building scalable architectures, integrating enterprise data, and operationalizing AI models across cloud environments. Key Responsibilities • Architect and deploy LLM- and RAG-based AI solutions for business use cases. • Integrate models such as Gemini, GPT, Claude, Llama 3 within enterprise systems. • Build pipelines using LangChain, LlamaIndex, and vector databases (Pinecone, Weaviate, FAISS). • Manage end-to-end lifecycle on Azure ML, Vertex AI, or AWS Bedrock. • Ensure data security, governance, and model performance monitoring. • Collaborate with engineering and MLOps teams to productionize AI systems. Qualifications • Bachelor’s or Master’s in Computer Science, AI/ML, or related field. • 8+ years in software/data architecture, with 3+ years in GenAI/LLMs. • Strong in Python, APIs, Docker/Kubernetes, and CI/CD. • Hands-on with RAG design, embeddings, fine-tuning, and prompt optimization. • Experience with Gemini, Azure ML, OpenAI, LangChain, and vector stores. Preferred • Exposure to multimodal AI (text, vision, audio). • Experience with LLMOps tools (LangFuse, PromptLayer, Weights & Biases). • Strong communication and documentation skills. • Kind Regards,
This job posting was last updated on 11/27/2025