Senior Data Engineer
Mass General Brigham
Mass General Brigham relies on a wide range of professionals, including doctors, nurses, business people, tech experts, researchers, and systems analysts to advance our mission. As a not-for-profit, we support patient care, research, teaching, and community service, striving to provide exceptional care. We believe that high-performing teams drive groundbreaking medical discoveries and invite all applicants to join us and experience what it means to be part of Mass General Brigham.
The CHoRUS for Clinical Care AI program includes leaders in the field applying AI to medical and health signals, including experts from the ODHSI, Physionet/MIMIC, and diverse Critical Care and Neuroscience Communities.
The MGB NeuroAI program supports machine learning and AI through large-scale multimodal data curation and analysis.
Job Summary
SummaryResponsible for designing, developing, and maintaining the data architecture and infrastructure within an organization. This position plays a crucial role in managing large-scale data systems and ensuring the efficient flow, storage, and accessibility of data for various stakeholders, such as data analysts, data scientists, and business users.
Does this position require Patient Care?
No
Essential Functions
-Design, develop, and implement data pipelines and ETL/ELT code to support business requirements.
-Work on cross-functional teams delivering enterprise solutions for internal and external clients.
-Assume ownership for delivering code revisions and enhancements from design through development and production installation.
-Maintain and optimize various components of the data pipeline architecture.
-Become subject matter expert for internal and external data products Ensure design solutions can scale and meet technical standards and performance benchmarks.
-Identify inefficient processes and develop recommendations and design solutions.
-Lead code review sessions to validate technical solutions and facilitate knowledge sharing.
Qualifications
Education
Bachelor's Degree Related Field of Study required
Can this role accept experience in lieu of a degree?
Yes
Licenses and Credentials
Experience
Experience in data engineering, with a focus on building and maintaining data infrastructure and pipelines 5-7 years required and Data warehousing development in large reporting environments 3-5 years required and Experience working with developing data pipelines using on Snowflake features ( Snowpipe, SnowSQL, Snow Sight, Data Streams ) required and Hands-on development experience with ETL/ELT tools, such as dbt, Fivetran, or Informatic required and Experience working in Agile software development environment required
Knowledge, Skills and Abilities
- Working knowledge of cloud computing platforms such as AWS, GCP, or Azure.
- Experience with enterprise database solutions in cloud or on-premise environments Adherence to sound engineering principles and practices when designing technical solutions.
Additional Job Details (if applicable)
Licenses/Certifications
Required:
- Snowflake SnowPro Core Certification (or higher).
- AWS/Azure/GCP Data Engineering Certification (e.g., AWS Certified Data Analytics, Azure Data Engineer Associate).
Preferred:
- Federated Learning
- Running jobs on high-performance computing servers
- Healthcare-specific certifications (e.g., HL7 FHIR Certification, Certified Health Data Analyst (CHDA)).
- Security certifications (e.g., CISSP, CIPP) for handling sensitive clinical data.
Work Experience
- 3+ years of experience in data engineering, with 2+ years focused on healthcare/clinical data (e.g., hospitals, EMR systems, clinical trials).
- Experience with OMOP CDM, Epic/Cerner EHR systems, or clinical data lakes.
Knowledge, Skills and Abilities
- Advanced proficiency in Snowflake (Snowpipe, Time Travel, Zero-Copy Cloning) and SQL for complex transformations.
- Strong programming skills in Python/Scala (Pandas, PySpark) for data scripting and automation.
- Hands-on experience with ETL/ELT tools (Apache Spark, AWS Glue, Azure Data Factory) and cloud platforms (AWS, Azure, GCP).
- Familiarity with healthcare data formats (OMOP, FHIR, HL7, DICOM) and clinical workflows.
- Expertise in federated learning and running large jobs on high-performance computing servers is a plus.
Remote Type
Work Location
Scheduled Weekly Hours
Employee Type
Work Shift
EEO Statement:
Mass General Brigham Competency Framework
At Mass General Brigham, our competency framework defines what effective leadership “looks like” by specifying which behaviors are most critical for successful performance at each job level. The framework is comprised of ten competencies (half People-Focused, half Performance-Focused) and are defined by observable and measurable skills and behaviors that contribute to workplace effectiveness and career success. These competencies are used to evaluate performance, make hiring decisions, identify development needs, mobilize employees across our system, and establish a strong talent pipeline.