Course Cap
🔴 LIVE: 0 hiring rooms active now
0 HRs ready to interview
Average hiring time improving
0 new rooms opened in last 10 mins
Join Live Rooms - Skip the wait, get hired faster
🔴 LIVE: 0 hiring rooms active now
0 HRs ready to interview
Average hiring time improving
0 new rooms opened in last 10 mins
Join Live Rooms - Skip the wait, get hired faster

LLM Engineer Job in Noida at Opkey

Interview with HRs instantly—live now.

Skip applications. Get hired faster in Live Rooms.

Join instant video interviews

company-logo
LLM Engineer

Opkey

  Full Time Job

  Not Disclosed

  2-4 years

  Posted  16 days ago

Location
  • Noida
Skills Required
  • Large Language Model
  • Amazon Web Services
  • PyTorch
  • Service Level Agreements
About this Job

Opkey is hiring for the role of LLM Engineer!

Responsibilities of the Candidate:

  • Build, fine-tune, and deploy Large Language Models (LLMs) and Small Language Models (SLMs) for product use cases.
  • Develop Retrieval-Augmented Generation (RAG) architectures, vector-search workflows, and intelligent document-processing pipelines.
  • Optimize model latency, accuracy, token efficiency, and cost for production environments.
  • Implement scalable MLOps workflows, including CI/CD pipelines, model versioning, monitoring, and automated evaluation.
  • Experiment with prompting, fine-tuning, quantization, distillation, and domain adaptation for ERP/testing-specific use cases.
  • Work with vector databases (Pinecone, FAISS, Milvus, etc.) to build robust retrieval systems.
  • Research new GenAI techniques and contribute to innovation across Opkey’s AI roadmap. 

Requirements:

  • Bachelors with 2-4 years, or Master's with 1-2 years, of experience in AI/ML engineering with hands-on exposure to NLP or GenAI.
  • Strong understanding of LLMs, embeddings, transformers, and generative AI concepts.
  • Experience with LLM fine-tuning (LoRAQLoRA, PEFT) or training SLMs for domain-specific tasks.
  • Practical experience with frameworks such as PyTorchJAX, Hugging Face Transformers.
  • Hands-on experience implementing RAG pipelines and working with vector databases (e.g., Pinecone, Weaviate, FAISS).
  • Strong Python programming skills with experience in building APIs or backend services.
  • Knowledge of deploying AI models to cloud platforms (AWS, E2ERunPod).
  • Basic understanding of the MLOps lifecycle—model packaging, containerization (Docker), EvaluationMonitoring, etc. 
Eligible Degrees
MBA / All Courses
Bachelor of Technology/Engineering / All Courses
Master of Technology / All Courses
Bachelor of Arts / All Courses
Bachelor of Science / All Courses

+96 More

Who can apply
Work Experience: 2-4 years
Eligible Graduation Years: 2024, 2023, 2022, 2021
Documents Required

1. Resume

2. ID Proof (e.g. Aadhar Card, PAN Card, etc.)

About Opkey
Not ready to apply yet?

Explore Live Hiring Rooms and interview with HRs instantly - no waiting, no lengthy applications!

🔴 Live Now

23

Active Rooms

47

HRs Online

👤

Priya S.

Got hired in 2 hours!

"Joined a Live Room at 2pm, interviewed instantly, and got the offer by 4pm. This is revolutionary!"

Stand out and get shortlisted up to 10X more

⚡ How Live Rooms Work
1

Browse live hiring rooms

2

Click to join - HR is waiting

3

Interview instantly, get hired faster

🔥 3 new rooms opened in the last 10 minutes!

Recommended Jobs For You
Not ready to apply yet?

Explore Live Hiring Rooms and interview with HRs instantly - no waiting, no lengthy applications!