We are looking for a Data Engineer to work on upstream R&D projects closely with a data science team, performing functional prototyping and facilitating the transfer of successful prototypes to production.
The primary focus will be on choosing optimal solutions to use for these purposes, then maintaining, implementing, and monitoring them.
You will also be responsible for integrating them with the architecture used across the company. You will be joining a growing data science team and will have the unique opportunity to help shape the future.
Assemble large, complex data sets that meet functional / non-functional business requirements and develop associated data layers that can easily be consumed by data science applications.
Partner with the data science team to prepare structured and unstructured data that they can use for predictive and prescriptive modeling
Work with data warehouse architecture team to integrate our warehouse (SQL Server) with big data technologies (e. g. Snowflake, Hadoop)
Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure in collaboration with stakeholders
Maintain knowledge of emerging data science technologies
Work in a rapid prototyping area where the ability to pivot is a must
2+ years of experience with big data platforms (minimum one of the following) – Snowflake, HDFS, Hive, LLAP/Impala, and NoSQL technologies (Elasticsearch, MongoDB, HBase, Redis, Etc. )
Familiarity building stream-processing systems, using technologies such as Kafka, Storm and Spark-Streaming
2+ years of experience with analytic programming languages – Python or/ and R or/and Java
Proficiency with relational databases (T-SQL a plus)
Proficiency with ELT/ETL scheduling tools such as Airflow
Familiarity with API creation and RESTful services
Proficient with GitHub/Git, or comparable distributed version control system
Docker or container orchestration experience (Kubernetes, Mesos)
Experience working with statistics and/or modeling libraries such as Numpy or Pandas
Interest in learning new technologies and languages
Ability to be resourceful and creative in a fast-paced daily release environment
What does C. H. Robinson offer you?
Contract of employment + package of benefits (private medical care/ multi cafeteria program/annual bonus 10%)
Work office in Warsaw Spire, near to metro station Rondo Daszyńskiego
Language lessons in small groups
An opportunity to use and develop your language skills in our international work environment
Working in the new team, which we are building from scratch in Warsaw, with close cooperation with US tea
Our main priority continues to be the health and safety of our employees. Due to the COVID-19 pandemic, this position will combine remote and office work. We will continue monitoring the circumstances and adjust our approach accordingly to ensure safety.
About C. H. Robinson
From the produce you buy, to the water you drink, C. H. Robinson delivers products to people all around the globe. We are one of the world’s largest 3rd party logistic providers. Join our diverse team to innovate, solve problems, have fun and thrive.
Warsaw, Masovian Voivodeship, Polska
|Praca na stanowisku:||Data Engineer|
|Dodano:||13. 9. 2021
Praca na stanowisku - aktualna
Bądź pierwszy, który ubiega się o to miejsce pracy!