🔴 LIVE: 0 hiring rooms active now•

0 HRs ready to interview•

Average hiring time improving•

0 new rooms opened in last 10 mins•

Join Live Rooms - Skip the wait, get hired faster•

🔴 LIVE: 0 hiring rooms active now•

0 HRs ready to interview•

Average hiring time improving•

0 new rooms opened in last 10 mins•

Join Live Rooms - Skip the wait, get hired faster•

LLM Engineer Job in Noida at Opkey

Interview with HRs instantly—live now.

Skip applications. Get hired faster in Live Rooms.

Join instant video interviews

LLM Engineer

Opkey

Full Time Job

Not Disclosed

2-4 years

Posted 16 days ago

Location

Noida

Skills Required

Large Language Model
Amazon Web Services
PyTorch
Service Level Agreements

About this Job

Opkey is hiring for the role of LLM Engineer!

Responsibilities of the Candidate:

Build, fine-tune, and deploy Large Language Models (LLMs) and Small Language Models (SLMs) for product use cases.
Develop Retrieval-Augmented Generation (RAG) architectures, vector-search workflows, and intelligent document-processing pipelines.
Optimize model latency, accuracy, token efficiency, and cost for production environments.
Implement scalable MLOps workflows, including CI/CD pipelines, model versioning, monitoring, and automated evaluation.
Experiment with prompting, fine-tuning, quantization, distillation, and domain adaptation for ERP/testing-specific use cases.
Work with vector databases (Pinecone, FAISS, Milvus, etc.) to build robust retrieval systems.
Research new GenAI techniques and contribute to innovation across Opkey’s AI roadmap.

Requirements:

Bachelors with 2-4 years, or Master's with 1-2 years, of experience in AI/ML engineering with hands-on exposure to NLP or GenAI.
Strong understanding of LLMs, embeddings, transformers, and generative AI concepts.
Experience with LLM fine-tuning (LoRA, QLoRA, PEFT) or training SLMs for domain-specific tasks.
Practical experience with frameworks such as PyTorch, JAX, Hugging Face Transformers.
Hands-on experience implementing RAG pipelines and working with vector databases (e.g., Pinecone, Weaviate, FAISS).
Strong Python programming skills with experience in building APIs or backend services.
Knowledge of deploying AI models to cloud platforms (AWS, E2E, RunPod).
Basic understanding of the MLOps lifecycle—model packaging, containerization (Docker), Evaluation, Monitoring, etc.

Eligible Degrees

MBA / All Courses

Bachelor of Technology/Engineering / All Courses

Master of Technology / All Courses

Bachelor of Arts / All Courses

Bachelor of Science / All Courses

+96 More

Who can apply

Work Experience: 2-4 years

Eligible Graduation Years: 2024, 2023, 2022, 2021

Documents Required

1. Resume

2. ID Proof (e.g. Aadhar Card, PAN Card, etc.)

About Opkey

Not ready to apply yet?

Explore Live Hiring Rooms and interview with HRs instantly - no waiting, no lengthy applications!

🔴 Live Now

Active Rooms

HRs Online

👤

Priya S.

Got hired in 2 hours!

"Joined a Live Room at 2pm, interviewed instantly, and got the offer by 4pm. This is revolutionary!"

Stand out and get shortlisted up to 10X more

⚡ How Live Rooms Work

Browse live hiring rooms

Click to join - HR is waiting

Interview instantly, get hired faster

🔥 3 new rooms opened in the last 10 minutes!

Recommended Jobs For You

Not ready to apply yet?

Explore Live Hiring Rooms and interview with HRs instantly - no waiting, no lengthy applications!