Job Posting: Rockies Research and Development Data Engineer

Position: Data Engineer


The Rockies are looking for a Data Engineer to join their Research and Development team. The successful candidate will be responsible for expanding and optimizing their data warehouse and data pipeline architecture, with a focus on collecting, cleaning, transforming, managing and validating data using distributed computing and storage systems. The goal of the Data Engineer is to democratize data, support data initiatives, ensure consistent data delivery and empower Rockies personnel to derive powerful and actionable insights.

Responsibilities and Duties:

  • Create, maintain and optimize data ETL/ELT pipelines
  • Documentation of data/pipelines
  • Ensure the ingestion of data and errors are handled without interruption
  • Process and securely store extremely sensitive data for callback and future use
  • Prepare distributed, disjoint, multi-formatted data sets for data scientists
  • Research and investigate new and interesting datasets to include in our data warehouse
  • Perform quantitative research related to baseball strategy and player evaluation
  • Collaborate with coaches, scouts and baseball operations to suggest process improvements


Education and Work Experience

  • Bachelor’s degree in Computer Science/Engineering
    • Candidates still in school (junior or senior level) with extensive work towards such degree will be considered
  • SQL knowledge and experience working with a variety of relational databases such as MySQL, PostgreSQL, or SQL Server
  • Experience with a variety of structured, semi-structured and un-structured data formats including delimited files, XML, JSON and natural language text
  • Ability to effectively use multiple programming languages including one of the major data science languages of Python, R or Scala
  • Experience or working knowledge of “Big Data” tools such as Hadoop, Hive, Spark or Presto is a plus
  • Experience with AWS Cloud services such as EC2, RDS, and S3 is a plus
  • Experience with data workflow tools such as Luigi, or Airflow is a plus
  • Knowledge and understanding of baseball and baseball statistics

Functional Skills

  • Ability to work evenings and weekends required
  • Passion for the intersection of baseball and data
  • Passion for quality data
  • Strong organizational skills and ability to self-start
  • Strong intellectual curiosity
  • Desire to learn and contribute
  • Ability to work in a collaborative and open team environment
  • Ability to develop and maintain successful working relationship with members of the Front Office

To Apply:
Qualified candidates should send their letter of interest and resume to no later than June 3, 2018.

We hoped you liked reading Job Posting: Rockies Research and Development Data Engineer by Meg Rowley!

Please support FanGraphs by becoming a member. We publish thousands of articles a year, host multiple podcasts, and have an ever growing database of baseball stats.

FanGraphs does not have a paywall. With your membership, we can continue to offer the content you've come to rely on and add to our unique baseball coverage.

Support FanGraphs

Meg is the managing editor of FanGraphs, the host of FanGraphs Audio, and the co-co-host of Effectively Wild. Her work has previously appeared at Baseball Prospectus, Lookout Landing, and Just A Bit Outside.

newest oldest most voted

I’m just curious. Do you ever get feedback from these listing agencies indicating how much value presenting this information to the Fangraphs community adds to the eventually candidate class? I’m assuming it adds something, otherwise I imagine that they wouldn’t work with you in providing the job post information. I’m just kind of curious regarding the nature of the relationship.

Kyle H
Kyle H

I’m not certain, but i think most of these were posted at and maybe thats where the poster got the information.