Data Extraction Lead Engineer

Unlock Employer Newport Beach, CA Contract Live

Required Skills

My Compatibility Score

Choose Match Score option:

Objectives • Participate in architecture, design and implementation of large-scale distributed system that extract data. • Perform data scraping, cleansing, curation, parsing, integration, semantic mapping and enrichment. • Create and maintain documentation and technical specs. • Perform analysis and monitoring on datasets to ensure completeness and integrity. • Coordinate project-related work with researchers and engineering teams. • Manage, monitor and mentor the effort of data pipeline including agent configuration and data publishing. • Monitor model production and testing processes; monitor dataset production; Investigate and troubleshoot issues • Address questions from internal consumers relating to market data, examine datasets and interact with vendors • Compile and analyze model production statistics and produce specialized reports Requirements • Exceptional academic background with bachelor’s degree or higher in Computer Science or Computer Engineering. • Deep knowledge of computer systems, object oriented design, data structures and algorithms • Experience in software development life cycle and developing large scale software systems • Experience with tick data, fundamental data, reference data • Experience in software development life cycle and developing large scale software systems • Experience in programming and working in IAAS infrastructure • Machine Learning Research and NLP Experience is a great plus • Possess core technical skills, including the following:  Proficiency in at least one compiled language like Go/Rust/Scala/C++ (required)  Proficiency in at least one scripting language like R/Ruby/Python/V8 (required)  Experience with kdb+ or InfluxDB (required)  Experience with modern development stack (GitLab/Docker/Kubernetes/Rancher) (desirable)  Database Administration/Programing (desirable)  Experience with kdb+/InfluxDB (desirable) read more