Data Analyst II
Mass General Brigham
Mass General Brigham relies on a wide range of professionals, including doctors, nurses, business people, tech experts, researchers, and systems analysts to advance our mission. As a not-for-profit, we support patient care, research, teaching, and community service, striving to provide exceptional care. We believe that high-performing teams drive groundbreaking medical discoveries and invite all applicants to join us and experience what it means to be part of Mass General Brigham.
Job Summary
The Senior Data Analyst will work within the Talkowski Laboratory in the Center for Genomic Medicine at Massachusetts General Hospital (MGH) and the Stanley Center for Psychiatric Research at the Broad Institute of MIT and Harvard. As a highly motivated, enthusiastic contributor, will work with our research group to detect, annotate, and characterize genetic variants in large cohorts of patients with autism and other neuropsychiatric conditions. The Talkowski Lab is a large, diverse, and interdisciplinary group of scientists dedicated to the characterization of genomic variation contributing to human disease. The group is leading and contributing to multiple international consortia that collect, process, aggregate, and analyze genomic sequencing data on a large scale.The position is part of the autism and neuropsychiatric subteam, which is part of the larger variant association team. Will process incoming sequencing datasets through established pipelines and will contribute to the development of new analysis workflows. Effectively communicate with other subteams to ensure that new analysis workflows are distributed across teams.
Perform high-quality, genome-scale computational analyses under the supervision of the group leader in a timely manner. Quickly learn new analytical approaches and be capable of applying and developing novel computational methods for solving complex problems. Must apply extensive practical programming experience, as well as experience with cloud computation and workflow management systems. Must also contribute experience with implementing large-scale computational analyses, algorithm development, or statistical analysis. Knowledge of existing analysis tools, methods, and databases in the field of genomics is a significant plus.
This position is hybrid, with the option of flexible remote working hours.
Qualifications
PRINCIPAL DUTIES AND RESPONSIBILITIES:
- Serve as member of the team analyzing the sequencing data for the Autism Sequencing Consortium. Assist with the development of analysis aiming at (1) calling and annotating variants in short read sequencing data and (2) developing association frameworks to link detected variants to phenotypes.
- Perform complex data acquisition, storage, cleaning, and pre-processing
- Performs advanced quantitative statistical analysis methods
- Dissemination of results via presentation and publications.
- Process incoming blended genome exome (BGE) datasets (i.e., a cost-effective method that combines low-pass whole genome sequencing with deep exome sequencing from a single DNA sample) from various neuropsychiatric cohorts and calling copy-number variants in these samples.
- Perform QC of results and generate reports to share with collaborators.
- Interact with other subteams, most notably the methods development team, to develop, improve, standardize, optimize, and distribute new and existing methods for genomics analyses to bridge the progress made by different subteams.
- Provide collaborative bioinformatics analysis in support of other research projects.
- Track and communicate progress to internal and external stakeholders at meetings and over Slack.
- Perform other responsibilities as needed.
SKILLS/ABILITIES/COMPETENCIES REQUIRED:
- Advanced skillset in computational biology, bioinformatics, statistics, or genomics.
- Advanced knowledge of statistical association analyses and approaches in genomics studies
- Experience in cloud-based computing preferred
- Must be able to work independently as well as part of a team in a fast-paced, highly collaborative and supportive environment
- Ability to adapt to shifting priorities in response to changing deadlines and the needs of the lab
- Excellent written and verbal communication skills
- Excellent organizational skills
- Excellent quantitative and organizational skills
- Proven ability to work well in a collaborative environment
- Proficiency in complex data mining, database management, biostatistics, and computer programming.
- Experience with cloud computing and workflow management systems. Experience with Terra, WDL, and Google Cloud is a plus.
- Proficient in Python, R, and Unix/Linux, and/or other scripting languages.
- Proven ability to learn new computational tools and packages.
- Good foundations in statistics.
- Ability to work independently and in a team setting in an organized fashion.
- Good interpersonal and oral/written communication skills in English.
- Strong ability and experience in interpreting computational results and translating these results into biologically relevant conclusions and hypotheses.
- Experience in genomics and handling large-scale datasets is a plus.
- Project management skills and/or experience is also a plus.
LICENSES, CERTIFICATIONS, and/or REGISTRATIONS (if applicable):
N/A
EDUCATION:
Bachelor’s degree required in related field of study. PhD degree strongly preferred.
EXPERIENCE:
2-5 years of experience required with experience in large-scale data analysis preferred;
SUPERVISORY RESPONSIBILITY (if applicable):
N/A
FISCAL RESPONSIBILITY (if applicable):
Prudent use of hospital resources
WORKING CONDITIONS:
Normal office conditions at Simches Research Center and Broad Institute. Periods of prolonged sitting and computer work expected. Some exposure to a research lab environment may occur. Work is conducted in on-site office space but may include remote, off-site opportunity.
Additional Job Details (if applicable)
Remote Type
Work Location
Scheduled Weekly Hours
Employee Type
Work Shift
Pay Range
$62,400.00 - $90,750.40/Annual
Grade
6
EEO Statement:
Mass General Brigham Competency Framework
At Mass General Brigham, our competency framework defines what effective leadership “looks like” by specifying which behaviors are most critical for successful performance at each job level. The framework is comprised of ten competencies (half People-Focused, half Performance-Focused) and are defined by observable and measurable skills and behaviors that contribute to workplace effectiveness and career success. These competencies are used to evaluate performance, make hiring decisions, identify development needs, mobilize employees across our system, and establish a strong talent pipeline.