📍 New York (USA)Полная удалёнка
The PulsePoint platform is powered by terabytes of impression-level data, allowing brands to efficiently engage the right audiences at scale while helping publishers increase yield through actionable insights.
As a part of the SRE team, you will be challenged, expected to grow your technical knowledge, challenge your fellow team members, and they will challenge you back.
Time zone
(GMT-04:00), New York.
Line manager
SRE Manager.
Why we are recommending
- The PulsePoint platform is powered by terabytes of impression-level data.
- Ability to work with modern technologies.
Responsibilities
- Ensure reliability and scalability of our multi datacenter and hybrid Linux environments.
- Managing the large-scale Linux infrastructure to ensure maximum uptime.
- Performance and reliability testing. This may include reviewing configuration, software choices/versions, hardware specs, etc.
- Advancing our technology stack with innovative ideas and new creative solutions.
- Participating in capacity management of core systems and services, application analysis and performance, and security tuning. Provide operational support of systems and build automation to remediate and address the root cause; with the goal of automating response to all non-exceptional service conditions.
- Create strategies for long term permanent fixes to critical production incidents.
- Maintain documentation, build tooling, and create alerts to both identify and address infrastructure reliability.
- Proactively identify system anomalies.
Requirements
- Thorough understanding of Linux ( CentOS in production ).
- Deep understanding of Puppet configuration management tool (experience with Chef, CFengine, or Salt also works).
- You know what git is and can easily resolve a merge conflict.
- Advanced knowledge of K8s and its ecosystem.
- Experience administering SQL/NoSQL databases ( MySQL, PostgreSQL, MongoDB, ES)
- Experience with scalable infrastructure monitoring solutions such as Icinga, Prometheus, ELK.
- Any scripting language (Python/Ruby/Shell etc).
- Understanding of basic networking concepts ( TCP/IP stack, DNS, CDN, load balancing ).
Company offers
- NY time work schedule: Moscow 4 pm-1 am.
- US holiday schedule.
- 21 days of vacation.
- You will work as a contractor.
Hiring process
You will have 4 stages:
- HR interview in English
- Phone screening.
- Tech interview with Lead and Senior Engineers about Linux, the architecture of modern infrastructure.
- Interview with CTO.
Петр Кузин Tech Recruiter