We are excited to share with you the job description for the Gen AI Architect position at Global Applications. Based on your background and experience, we believe this could be a great opportunity for you.
Job Title: Gen AI Architect
Location: Remote (U.S. Based)
Employment Type: W2 Only
Job Overview:
We are seeking a highly skilled Gen AI Architect with deep expertise in Large Language Models (LLMs), Retrieval-Augmented Generation (RAG) frameworks, and GPU-accelerated AI solutions. This role is a key contributor to designing and implementing generative AI solutions that will drive innovation and automation at scale for a leading client in the aerospace domain.
Key Responsibilities:
- Design and architect scalable GenAI solutions leveraging LLMs and RAG techniques.
- Develop and optimize AI/ML pipelines for training and inference on GPU-based infrastructure.
- Collaborate with data scientists, machine learning engineers, and product teams to integrate GenAI capabilities into enterprise-grade applications.
- Evaluate and recommend LLMs (open-source or proprietary) based on use case requirements.
- Ensure AI solutions meet performance, scalability, and security standards.
- Stay abreast of advancements in generative AI and propose innovative use cases and solutions.
Required Skills & Qualifications:
- Strong experience with LLMs (e.g., OpenAI, LLaMA, Falcon, Mistral, Claude, or similar).
- Expertise in RAG architecture, including vector databases (e.g., FAISS, Pinecone, Weaviate, or similar).
- Deep understanding of GPU architectures (e.g., NVIDIA CUDA, TensorRT) for AI workloads.
- Proficiency in Python and ML frameworks such as TensorFlow, PyTorch, or Hugging Face Transformers.
- Experience deploying models using containerization and orchestration tools (Docker, Kubernetes).
- Strong knowledge of AI/ML system design, MLOps, and model evaluation techniques.
- Excellent communication and documentation skills.