Senior Gen AI Engineer – AI/LLM Backend Focus (Contract to Hire, Remote – U.S. Eastern Time Zone)

Jobs
Informulate

Informulate

-

🌎 Remote

Posted on: 1 June, 2025

Senior Gen AI Engineer – AI/LLM Backend Focus (Contract to Hire, Remote – U.S. Eastern Time Zone)

Role Overview

Informulate is seeking a Gen AI Engineer with deep expertise in large language model (LLM) architectures to join our tight‑knit team. In this role, you will design and implement cutting-edge AI/LLM backend systems that power intelligent applications—everything from chatbots to semi‑autonomous agents. You’ll architect robust backends: orchestrating agentic workflows, retrieval‑augmented generation (RAG) pipelines, and high‑performance knowledge bases.

This is a fully remote position (U.S. only) with flexible hours on Eastern Time. We welcome both part‑time and full‑time contractors for this role. You’ll collaborate virtually to build AI‑driven solutions that innovate and scale.

Key Requirements

  • LLM Framework Expertise: Deep experience with frameworks such as LangChain, LangGraph, Crew AI, or equivalent for building conversational and agent‑driven applications.
  • Agentic Workflows: Strong understanding of agent types and paradigms—ReAct (reason‑act), planning agents, reflective agents—and hands‑on experience implementing multi‑step AI workflows.
  • Retrieval‑Augmented Generation (RAG): Advanced knowledge of RAG techniques, including graph RAG, hybrid RAG, and agentic RAG implementations to augment LLMs with external knowledge.
  • Knowledge Bases & Vector Databases: Experience building and maintaining scalable vector‑based knowledge bases (e.g., Pinecone, Weaviate). Proficiency optimizing vector search with approximate nearest neighbor (ANN) algorithms (HNSW, IVF, PQ), reranking techniques, max‑inner‑product search, cosine similarity, and libraries like Faiss, Annoy, or similar.
  • Python & Pydantic: Expert in Python, with strong use of Pydantic for data validation and modeling in AI pipelines.
  • Model Fine‑tuning & Evaluation: Hands‑on experience fine‑tuning LLMs and using evaluation frameworks such as LangSmith to measure and improve model performance.
  • MCP Protocol: Understanding of the Model Context Protocol (MCP) and how to integrate MCP servers and clients for dynamic context management.
  • AWS Bedrock: Familiarity with AWS Bedrock services for deploying and scaling AI models, or demonstrated ability to learn quickly.
  • Voice‑Enabled AI: Experience integrating voice‑enabled AI technologies, such as OpenAI’s real‑time Voice API.
  • Model Providers & Private LLMs: Proven experience working with major API model providers (OpenAI, Anthropic Claude, Google Gemini) as well as open‑source LLMs (e.g., Meta’s Llama), including hosting private LLM instances and integrating diverse provider APIs.
  • Front-end experience with React; backend experience with Node.js and TypeScript.

Preferred Qualifications

  • Cloud Deployment: Proven record deploying LLM applications on cloud platforms (AWS preferred), including CI/CD, containerization (Docker), and security best practices for AI services.
  • Prompt Engineering: Mastery of prompt design and engineering techniques, with an emphasis on systematic evaluation and optimization.
  • AI‑Native Architecture Design: Ability to architect modular, scalable AI‑first systems, leveraging serverless functions or pipeline patterns optimized for AI/ML workloads.

Application Domains

Work on diverse AI‑driven projects, such as:

  • LLM‑Powered Chatbots: Scalable conversational agents for customer support, knowledge retrieval, or virtual assistants.
  • Workflow Automation: Intelligent automation tools that orchestrate business processes and decision logic with minimal human supervision.
  • Semi‑Autonomous Agents: Systems of collaborative agents employing planning and reflection to achieve complex tasks.

Company & Culture

Informulate (informulate.com, informulate.ai) is a U.S.‑based software consulting firm specializing in innovation, custom development, and AI-driven digital products. Founded in 2006, we combine Lean Startup, Agile, and Design Thinking to deliver measurable business impact. Our dedicated AI division, Informulate.AI, focuses on generative AI research, and strategic AI solutions.

We are a remote‑first company with a tight‑knit culture built on our core values:

  • Empathy: Deeply understand and advocate for user and client needs.
  • Quality: Pursue excellence and continuous improvement in every line of code.
  • Integrity: Uphold trust and accountability in all interactions.
  • Impact: Deliver solutions that create lasting business value.
  • Simplicity: Strive for clarity and elegance in design and implementation.
  • Responsiveness: Act swiftly and adapt to evolving challenges.

How to Apply

If you’re passionate about pushing the boundaries of AI and meet the requirements above, we want to hear from you! Send your resume and any relevant project links (GitHub, portfolio, LinkedIn) to ashley.capielo@informulate.net. Include a brief introduction explaining why you’re a great fit for our Gen AI Engineer role.

We look forward to innovating together!

Job Types: Full-time, Contract

Pay: $60.00 - $70.00 per hour

Expected hours: 40 per week

Benefits:

  • Flexible schedule

Compensation Package:

  • Hourly pay

Schedule:

  • 8 hour shift
  • Choose your own hours

Application Question(s):

  • Please provide 3 references.

Work Location: Remote

Tags:
ai
ml
Share the job:

Related Jobs

AI Engineer​/remote
AI Engineer​/remote

Full Time - 🌎 Remote