Nebius

Senior Software Engineer (Spark Over YT)

в Nebius

4 000 —‍ 8 000 €/мес на руки

📍 Амстердам (Нидерланды)Белград (Сербия)Помощь с переездом
Специализация
Java / Scala
Уровень
Senior
Команда
6 - 10 человек

Nebius AI is an AI-centric public cloud platform specifically crafted to serve AI models for training and inference.

Our mission is to help ML practitioners concentrate on their core jobs, while DevOps, MLOps, and infrastructure-related tasks are handled by us. The idea is to build an ML-specific cloud platform covering the entire ML lifecycle from A to Z: from data preparation and labeling to ML training and inference.

We recognize the potential of ML and AI technologies and aim to provide our future users with the perfect environment to train and fine-tune their models. We are committed to delivering the best user experience and excellent customer support.

  • Four development hubs: Nebius is headquartered in the Netherlands, with hubs in Finland, Serbia, and Israel.
  • Data center in Europe: Our own data center in Finland features server racks designed in-house for ML-specific high loads, with power-efficient solutions, including a free-cooling system.

  • 500+ professionals: Our mature team of Engineers has a proven track record in developing sophisticated cloud and ML solutions and designing cutting-edge hardware.

At Nebius, we're on a mission to harness the power of massive data, and we're looking for an innovative and passionate Engineer experienced in Apache Spark internals to join our team. Our platform, YTsaurus, operates with exabytes of data, and we've recently made this powerful tool open-source. You'll be at the forefront of integrating Apache Spark with YTsaurus, creating an efficient data handling ecosystem.

The unique feature of Spark over YTsaurus, or SPYT as we call it, is its deep low-level integration between Apache Spark compute and YTsaurus storage. This integration allows for efficient processing by utilizing metadata. SPYT supports YTsaurus transactions and uses knowledge of table sorting to eliminate the shuffle phase during JOIN operations. Additionally, SPYT employs YTsaurus as an execution environment for launching Spark clusters in a cloud manner, enabling dozens of SPYT clusters to operate simultaneously within YTsaurus.

For more detailed overview of SPYT, have a look at couple of talks by active and former members of YTsaurus SPYT special interest group:

SPYT is actively used by both internal and commercial users of YTsaurus in Nebius and outside our company. Also, we must define the place of Spark in our next-gen AI platform based on top of YTsaurus. To accomplish these tasks, we need a passionate Spark expert ready for challenges and not afraid of taking responsibility.

You’re welcome to work in our offices in Amsterdam and Belgrade, hybrid or remotely.

Stack

Java/Scala/KotlinApache SparkApache Hadoop

In this position, your responsibility will be to

  • Enable and optimize modern features of Apache Spark within the SPYT framework.
  • Develop new features for seamless integration between Spark and YTsaurus.
  • Provide support to both internal and external users of SPYT and YTsaurus, ensuring a smooth operation.
  • Engage with the open-source community, collaborating with big data experts to enhance YTsaurus's capabilities.
  • Advocate for SPYT as the key tool for AI applications, including training and dataset preparation.

We expect you to have

  • 5+ years of experience as a Software Engineer.
  • Practical knowledge of Scala (preferably) or other JVM-based languages (Java, Kotlin).
  • Familiarity with Apache Spark or Apache Hadoop on the user side.
  • Knowledge of classical algorithms and data structures.
  • Ability to use SSH, experience deploying your code and debugging software using logs.

It would be an added bonus if you had:

  • Experience in Apache Spark internals or even PRs for open-source upstream.
  • Background in developing distributed DBMS or computation engines.
  • Understanding of data streaming primitives and streaming computation engines: Apache Kafka, Spark Streaming, Apache Flink.
  • Experience using DataBricks, SnowFlake or Google DataProc.
  • Knowledge of YTsaurus from the user’s side.

Does this sound like the challenge you've been looking for? If so, we invite you to join us!

Екатерина Козяйкина IT Recruiter
Nebius

О компании Nebius

Сфера
Продуктовая компания
Размер
11 - 50

Nebius — современная IT-компания, помогающая создавать собственные локальные облачные платформы крупным B2B-бизнесам. Nebius предоставляет не только технологии, но и готовую к запуску бизнес-модель, включая инструменты для поддержки, продаж и маркетинга.

Похожие вакансии

280 000 – 450 000 ₽/мес на руки
📍 Любая страна, полная удалёнка, самостоятельный переезд
275 000 – 435 000 ₽/мес на руки
📍 Москва, Санкт-Петербург, Минск (Беларусь), Белград (Сербия), самостоятельный переезд
250 000 – 450 000 ₽/мес на руки
📍 Москва (м. Парк культуры), Санкт-Петербург (м. Площадь Ленина), Нижний Новгород, Минск (Беларусь), Белград (Сербия), самостоятельный переезд
250 000 – 450 000 ₽/мес на руки
📍 Москва (м. Парк культуры), Санкт-Петербург (м. Площадь Ленина), Нижний Новогород, Минск (Беларусь), Белград (Сербия), самостоятельный переезд
250 000 – 500 000 ₽/мес на руки
📍 Москва (м. Деловой центр), Санкт-Петербург (м. Лиговский проспект), Сербия, Армения, Казахстан, полная удалёнка, самостоятельный переезд