Site Reliability Engineer - Docker

New York Full Time Live

Required Skills

My Compatibility Score

Choose Match Score option:

Ansible AWS Chef Linux Puppet Python API Architect Docker Elasticsearch hadoop HBase Hive infrastructure Kafka Kubernetes management MySQL OOP Pig Postgres production qa ruby security Warehousing YARN zookeeper
show more
Automatch with LinkedIn
The SRE team works on coding, automating, and increasing the availability, reliability and performance of company's internal and external services.

What you'll do:

Architect and manage our container infrastructure
Work with developers to build a deploy pipeline for rapid development and QA
Integrate our container efforts with our non-container infrastructure to deliver production data
Develop the systems that empower data gathering in the Deep & Dark Web
Ensure our systems are available, scalable, and monitored
Focus on internal tooling, automation, data warehousing, and security
Test and tune performance issues across components and services

REQUIREMENTS

Who you are:

Deep experience in containerization, especially in production environments
Skilled in either Python or Ruby (OOP experience a huge plus!)
Willingness to learn, teach, and code review
Strong background in Linux
Previous experience and responsibilities in critical and complex systems
Experience with config management systems (Ansible, Chef, Puppet, Salt)
Experience with AWS or GCE (API usage a plus!)

Tools we like. (Experience in the following or similar is a plus, not a requirement)

Containerization (docker, Kubernetes)
ZooKeeper, etcd
Hadoop (HDFS, YARN, Pig, Hive)
Postgres, MySQL/MariaDB, Elasticsearch, HBase
Kafka
Metrics (OpenTSB)
Monitoring (Icinga2)
Logging (Kibana, Logstash) read more