getmatch agency

AI / LLM Engineer (Local Models / RAG / API Development)

в getmatch agency

3 500 —‍ 7 000 $/‍мес на руки

📍 КипрСамостоятельный переезд
Специализация
Data Scientist & Machine Learning
Уровень
Middle
Требуемый опыт
2+ лет

Технологии/инструменты

PythonFastAPIREST APILlama / Qwen / Mistral

Our client is a leading distributor of Information and Communications Technology (ICT) products and solutions across the EMEA region. They are looking for an AI / LLM Engineer to join a team building an internal AIdriven automation platform.

About the Role

The team is developing a new internal automation platform that connects business processes, AI models, and internal systems. The goal is to build an internal AI platform powered by self-hosted LLMs, with a strong focus on data privacy and secure processing.

As an AI / LLM Engineer, you will be responsible for deploying LLMs in local or hybrid environments, building an API layer around them, and integrating these models into automation workflows (n8n) and internal tools.

You will work on model performance, RAG pipelines, and system reliability.

Responsibilities

  • Deploy and optimize local LLM models (Llama, Qwen, Mistral, etc.).
  • Build and maintain API endpoints (FastAPI / REST) for model interaction.
  • Design, implement, and maintain RAG pipelines (embeddings, chunking, vector search).
  • Work with vector databases (Chroma, Milvus, FAISS, Qdrant).
  • Integrate AI services into n8n automation workflows.
  • Tune prompts and model behavior to improve response quality.
  • Monitor model performance, latency, and resource utilization.
  • Ensure secure and private processing of internal company data.

Skills and Qualifications

  • Strong experience with Python, FastAPI, and REST APIs.
  • Hands-on experience with LLM deployment and inference.
  • Experience working with Llama / Qwen / Mistral models.
  • Practical experience with vector search and RAG architectures.
  • Understanding of API design, performance tuning, and system optimization.
  • 2+ years of experience in ML / AI engineering.

Nice to Have

  • Experience working with NVIDIA GPU servers.
  • Basic knowledge of LoRA / QLoRA fine-tuning.
  • Experience integrating AI systems with automation platforms.
  • Understanding of data privacy and secure ML workflows.

Working Conditions

  • Fixed working schedule.
  • Travel required.
  • Opportunity to work for a financially strong, fast-growing multinational company.
  • Continuous collaboration with global teams.
  • International career growth opportunities.
  • Access to professional development: training, certifications, events, team buildings.
  • Health insurance.
  • Competitive salary and motivation scheme.
  • Life event gifts, corporate awards, and long-service bonuses.

Work Format

  • Location: Cyprus.
  • Format: Office or Hybrid (3 remote days, 2 in-office days — Tuesdays and Thursdays).

Desired Start Date

  • As soon as possible.
Анна Янова рекрутер
getmatch agency

О компании getmatch agency

Сфера
Рекрутинговое агентство
Размер
11 - 50

getmatch — это рекрутинговое агентство, специализирующееся на поиске разработчиков, UI/UX-дизайнеров, продуктовых менеджеров и других IT-специалистов для технологических компаний по всему миру. Клиенты getmatch: Яндекс, Т-Банк, Мегафон, МТС, Авито, Marketfinance, Revolut, Workato, Arrival и другие.

Похожие вакансии

Зарплата скрыта, но соответствует вашей подписке
Можно удалённо из РФ
200 000 – 350 000 ₽/мес на руки
Полная удалёнка
Зарплата скрыта, но соответствует вашей подписке
📍 Москва (м. Войковская), можно удалённо из РФ
Зарплата скрыта, но соответствует вашей подписке
📍 Москва (м. Войковская), можно удалённо из РФ
250 000 – 320 000 ₽/мес на руки
Полная удалёнка