Top 16 Vector database langchain transformer architecture Jobs

AI Engineer
Better Direct —Tempe, AZ
- $60,000 - $80,000 a year
- Full-time
- Background check
- Azure
- System design
Quick Apply
Sr AI/ML Engineer - Hybrid in MN or DC - Remote elsewhere
Optum —Eden Prairie, MN
- $91,700 - $163,700 a year
- Full-time
- Hybrid work
- Azure
- Pass drug screening
- System design
AI Architect
NTT DATA —Auburn Hills, MI
- $83.55 an hour
- Computer Science
- System design
- 3 years
Quick Apply
Full Stack Developer
Steampunk —McLean, VA
- $140,000 - $190,000 a year
- Remote
- System design
- Mid-level
- Java
4d
Service Delivery Center, AI & Data, Machine Learning Engineer (MLE) - Senior
EY —Manayunk, PA
- $65,500 - $152,100 a year
- System design
- Research
- Driver's License
- Paid holidays
- Health insurance
Senior AI Engineer – LLM, RAG
BrightAI —Palo Alto, CA
- Computer Science
- 5 years
- System design
AI/ML SOFTWARE ENGINEER
Ignite Fueling Innovation —Huntsville, AL
- Full-time
- Computer Science
- Research
- Mid-level
Senior Data Scientist
Garmin International, Inc. —Olathe, KS
- Full-time
- Azure
- Computer Science
- 5 years
Senior Security Engineer, AI/ML
Qualys —Foster City, CA
- $115,000 - $140,000 a year
- Full-time
- Azure
- 6 years
- SQL
- Health insurance
- Retirement plan
3d
AI Cybersecurity Engineer
Upbound Group —Plano, TX
- Azure
- System design
- Office
16h
Senior Backend Engineer, AI Team
Acorns —United States
- $184,500 - $217,100 a year
- Full-time
- Remote
- Computer Science
- 3 years
- Master's degree
- Paid time off
- Parental leave
Quick Apply
Backend ML Engineer
Sterling Computers Corporation —United States
- Full-time
- Remote
- Computer science
- Azure
- Computer Science
Quick Apply

More jobs to discover

Find similar jobs that you might be interested in.

Sr AI/ML Engineer - Hybrid in MN or DC - Remote elsewhere
Optum —Eden Prairie, MN
You'll collaborate with world-class talent across data science, engineering, product, and healthcare domains, backed by the reach and stability of Optum and…
$91,700 - $163,700 a year
Lead AI/ML Engineer: Hybrid in Minnesota or DC, Remote elsewhere
Optum —Eden Prairie, MN
In this role, you will own the architecture for the capabilities customers feel every day, including real time voice translation, voice based assessments, and…
$112,700 - $193,200 a year

I want to receive the latest job alert for vector database langchain transformer architecture

By signing in to your account, you agree to SimplyHired's Terms of Service and consent to our Cookie and Privacy Policy.

Explore jobs in more locations

vector database langchain transformer architecture jobs near

AI Engineer

Better Direct -
Tempe, AZ

Quick Apply

Job Details

Full-time
$60,000 - $80,000 a year

Qualifications

AI models
Index optimization
TensorFlow
Data transformation pipeline development
NumPy
Generative models
AI platforms (beyond public GPTs)
Computational framework
Pandas
Machine intelligence
GitHub
Neural networks
Metadata
Benchmarking
Machine learning libraries
Model evaluation
Machine learning frameworks

Full Job Description

About the Role

We are looking for a motivated AI Engineer who is passionate about large language models (LLMs), machine learning, and applied AI systems. This position is focused on building real-world AI systems, not just experimenting with models.

You will work on projects involving open-source LLMs, retrieval-augmented generation (RAG) pipelines, vector databases, and AI-powered document processing systems. The goal is to build scalable AI workflows that solve practical problems such as knowledge retrieval, document analysis, and AI-assisted automation.

This role is ideal for engineers who want hands-on experience deploying AI systems used in production-like environments.

Responsibilities

AI & Machine Learning Development

Build and experiment with open-source LLM and SLM pipelines.
Design and implement Retrieval Augmented Generation (RAG) systems.
Develop AI pipelines capable of processing documents, PDFs, and structured data.
Work with embedding models and vector search systems.
Implement prompt engineering, model evaluation, and response optimization.
Assist with fine-tuning or adapting open-source models when necessary.

Data Processing & AI Pipelines

Build ingestion pipelines for PDFs, documents, and datasets.
Implement document chunking, embedding generation, and indexing strategies.
Work with vector databases to support semantic search and retrieval.
Optimize pipelines for latency, scalability, and cost efficiency.

Research & Experimentation

Evaluate different open-source models and architectures.
Compare embedding models and retrieval methods.
Test improvements in RAG performance and hallucination reduction.
Explore emerging techniques in Vision Language Models (VLMs).

Collaboration

Work with engineers to integrate AI components into applications.
Document experiments and technical findings.
Participate in weekly discussions on AI architecture decisions and improvements.

Required Skills & Knowledge

Core AI Knowledge

Strong understanding of:
Machine Learning
Neural Networks
Deep Learning fundamentals
Familiarity with:
Large Language Models (LLMs)
Small Language Models (SLMs)

Programming

Strong proficiency in Python
Experience with AI/ML frameworks such as:
PyTorch
TensorFlow
Hugging Face Transformers

AI Application Frameworks

Experience with:

LangChain or similar orchestration frameworks
Prompt engineering and AI workflow building

Vector Databases

Conceptual and practical understanding of any vector databases such as:

Pinecone
ChromaDB
Milvus
Qdrant
FAISS
Weaviate

Understanding of:

embeddings
similarity search
indexing strategies
metadata filtering

RAG Systems

Ability to design or understand:

Retrieval pipelines
Document chunking strategies
Embedding pipelines
Hybrid search
Context window optimization
RAG evaluation methods

Data Processing

Experience with:

PDF extraction
Document parsing pipelines
Data preprocessing

Bonus Knowledge

Vision Language Models (VLMs)
Multimodal AI systems
Distributed AI inference
GPU inference optimization

Preferred Project Experience

Candidates should have completed at least 1–3 hands-on AI projects, such as:

Example Project 1 – RAG Knowledge Assistant

Built a chatbot that answers questions from internal documentation.
Implemented document ingestion, chunking, embedding generation, and vector search.
Used LangChain + vector database + open-source LLM.

Example Project 2 – Document AI System

Created a system that extracts structured information from PDFs.
Built pipelines for PDF parsing → embeddings → AI summarization.

Example Project 3 – AI Research Experiment

Compared multiple embedding models and evaluated search accuracy.
Benchmarked RAG response quality and hallucination rates.

Example Project 4 – LLM Application

Built a real-world tool using open-source models (e.g., summarizer, Q&A system, coding assistant).

Real-World Experience

Candidates may also have experience such as:

Contributing to open-source AI projects
Participating in AI hackathons
Research experience in machine learning or NLP
Building production-style AI APIs
Deploying models using Docker or cloud platforms
Working with LLM inference servers (vLLM, TGI, Ollama, etc.)

Tools & Technologies (Exposure Preferred)

AI / ML

PyTorch
Hugging Face
Transformers
Sentence Transformers

LLM Tools

LangChain
LlamaIndex
Open-source LLMs (LLaMA, Mistral, etc.)

Vector Databases

Pinecone
ChromaDB
Milvus
Qdrant
FAISS
Weaviate

Data Tools

Pandas
NumPy

Deployment

Docker
REST APIs / FastAPI
Basic cloud exposure (AWS/GCP/Azure)

What You Will Gain

Hands-on experience building real AI systems
Exposure to modern LLM architecture and AI infrastructure
Experience with RAG pipelines and vector databases
Mentorship from engineers working in applied AI
Opportunity to contribute to real production-style AI tools

Candidate Profile

We are looking for someone who:

Is curious and loves experimenting with AI systems
Enjoys solving practical engineering problems
Can quickly learn new frameworks and models
Is comfortable reading research papers and technical documentation
Has strong problem-solving and debugging skills

Application Requirements

Please include:

Resume
GitHub profile (required)
Links to AI/ML projects
Brief description of a RAG or LLM system you have built

Pay: $60,000.00 - $80,000.00 per year

Work Location: In person

Quick Apply

Refine Your Search

vector database langchain transformer architecture jobs

AI Engineer

Sr AI/ML Engineer - Hybrid in MN or DC - Remote elsewhere

AI Architect

Full Stack Developer

Service Delivery Center, AI & Data, Machine Learning Engineer (MLE) - Senior

Senior AI Engineer – LLM, RAG

AI/ML SOFTWARE ENGINEER

Senior Data Scientist

Senior Security Engineer, AI/ML

AI Cybersecurity Engineer

Senior Backend Engineer, AI Team

Backend ML Engineer

More jobs to discover

I want to receive the latest job alert for vector database langchain transformer architecture

Explore jobs in more locations

Job Seeker Tools

Employer Tools

Browse

Stay Connected