Senior LLM Engineer: Text & Reasoning LLM / NLU

Omilia

  • Italia
  • Tempo indeterminato
  • Full time
  • 13 ore fa
  • Candidati facilmente
Role PurposeTo lead the technical development and continuous improvement of Omilia’s proprietary LLM and NLU service portfolio, powering the reasoning and language understanding layer of the enterprise-grade Self-Learning Agentic CX Platform. This role ensures technical correctness, system quality, and delivery monitoring for mission-critical AI services that enable autonomous agents to reason, plan, execute tasks, and self-improve in real customer conversations.Accountabilities
  • Technical Ownership: Hold final technical authority over all LLM/NLU services, including entity/intent classification, specialized LLMs, and agentic orchestration.
  • System Quality: Ensure production stability, performance, and compliance (including PCI/PII) across the LLM/NLU domain.
  • Delivery Monitoring: Commit to delivery dates, drive features from design through deployment, and proactively flag risks.
  • Autonomy: Resolve technical ambiguity, structure loosely defined requirements, and make architectural decisions independently.
  • Scope & Complexity: Lead the most complex, ambiguous, or cross-cutting features, including model research, agentic reasoning, and inference server development.
  • Impact: Directly influence the quality and reliability of AI services serving millions of customer interactions in regulated industries.
  • Influence/Mentorship: Guide and mentor mid-level and junior engineers through code reviews, pairing, and knowledge transfer; drive alignment between Product, Architecture, and Engineering.
Key Responsibilities
  • Lead research and experimentation on new model architectures, training strategies, and evaluation methodologies for LLM/NLU.
  • Design, develop, fine-tune, and evaluate specialized LLMs for Concierge and Task Agents.
  • Develop and optimize ML pipelines for training, evaluation, and deployment (AWS SageMaker).
  • Architect and maintain inference servers, ensuring low latency and high reliability.
  • Implement and evolve closed-loop self-learning systems for continuous model improvement.
  • Drive benchmarking, experiment reproducibility, and documentation quality.
  • Ensure compliance with data privacy standards throughout the ML lifecycle.
  • Mentor and support the growth of team members; share expertise via tech talks and guides.
RequirementsKnowledge, Skills & Experience
  • 5+ years in applied LLM/ML/NLU/NLP, with ownership of production ML systems at scale.
  • Strong hands-on skills in Python, PyTorch, and HuggingFace Transformers.
  • Deep experience with LLMs: fine-tuning, distillation, prompt engineering, evaluation, and deployment (especially small/efficient models).
  • Solid foundation in NLU: intent classification, entity extraction, etc.
  • Experience with model serving infrastructure (e.g., Triton Inference Server, vLLM, TGI, FastAPI).
  • Experience with cloud ML infrastructure (AWS SageMaker, Bedrock, or equivalent).
  • Proven architectural decision-making and technical ownership across services/products.
  • Ability to break down ambiguous problems and drive actionable plans.
  • Excellent communication skills for both technical and non-technical audiences.
Nice to Have:
  • Experience with agentic system design (tool use, reasoning chains, multi-step planning).
  • Experience with self-learning/continuous improvement ML systems.
  • Multilingual NLU or cross-lingual transfer experience.
  • Familiarity with PCI/PII compliance in ML workflows.
  • Experience with experiment tracking tools (Weights & Biases, MLflow).
  • Open-source ML/NLP contributions or publications at top venues.
  • Experience with speech or multimodal LLMs.
Omilia Note- Contribute actively and effectively as an integrated team member.
- Act as an Omilia ambassador in all interactions.Benefits
  • Fixed compensation;
  • Long-term employment with the working days vacation;
  • Development in professional growth (courses, training, etc);
  • Being part of successful cutting-edge technology products that are making a global impact in the service industry;
  • Proficient and fun-to-work-with colleagues;
  • Apple gear
If you're a rare builder who speaks both code and pipeline, stays ahead of AI trends, and thrives as a cross-functional collaborator, we want to hear from you.Apply Now to join Omilia and help engineer the future of conversational AI.Omilia is proud to be an equal opportunity employer and is dedicated to fostering a diverse and inclusive workplace. We believe that embracing diversity in all its forms enriches our workplace and drives our collective success. We are committed to creating an environment where everyone feels welcomed, valued, and empowered to contribute their unique perspectives without regard to factors such as race, color, religion, gender, gender identity or expression, sexual orientation, national origin, heredity, disability, age, or veteran status, all eligible candidates will be given consideration for employment.

Omilia

Lavori simili

  • Data Scientist

    Lodestar S.p.A.

    • Milano
    Lodestar è un gruppo di aziende leader nel settore ICT, specializzato nella consulenza, nei servizi e nello sviluppo di soluzioni tecnologiche avanzate. Con oltre 500 professionist…
    • 13 ore fa
  • Data scientist

    Sell & More

    • Milano
    Sell&more è alla ricerca di un/una Data Scientist per importante società di consulenza digitale specializzata nella progettazione e implementazione di soluzioni innovative, con for…
    • 13 ore fa
  • Data scientist

    Sell & More

    • Milano
    Sell&more è alla ricerca di un/una Data Scientist per importante società di consulenza digitale specializzata nella progettazione e implementazione di soluzioni innovative, con for…
    • 7 giorni fa