Data Scientist

Massachusetts General Hospital
Somerville, Massachusetts, US
Full-time
We are sorry. The job offer you are looking for is no longer available.

The Clinical Augmented Intelligence Group (CLAI) is seeking a Geospatial Data Scientist with a strong Geoinformatics and Machine Learning background to develop new spatial and temporal methods to address limitations of the multi-scale computational models used in environmental health and climate research.

Our group conducts research at the intersection of computational sciences and health. Led by Dr. Hossein Estiri, CLAI resides within the Massachusetts General Hospital (MGH) Department of Medicine.

As the largest hospital-based research program in the world, MGH enables CLAI research to leverage state-of-the-art large-scale geospatial, environmental, and clinical data for developing specialized AI / ML methods to enrich human phenotype models and exposure assessment.

The following information aims to provide potential candidates with a better understanding of the requirements for this role.

This position is initially available for one year with a possible extension for up to three years based on performance evaluation.

Salary will be commensurate with the candidate's experience and qualifications.

PRINCIPAL DUTIES AND RESPONSIBILITIES

  • Participate in cutting-edge research in environmental / exposure / clinical data science and machine learning applications.
  • Work with CLAI faculty and staff as well as an interdisciplinary team of collaborators within the Mass General Brigham system and the Harvard community.
  • Develop knowledge discovery algorithms for large-scale real-world data.
  • Automate analytical data processes / workflows for advanced geospatial modeling and simulation.
  • Translate computational algorithms into parallelized and GPU-optimized code.
  • Design and implement environmental / geospatial / clinical data quality assessment procedures.
  • Adapt new procedures, methods, or instrumentation for collecting, preparing, and analyzing continental / global scale environmental exposure data.
  • Maintain relational and geospatial databases of research data.
  • Contribute to experiments with Generative AI, and Large Language Models (LLMs).
  • Contribute to data filtering and curation for LLMs pre and post-training.
  • Tabulate and visualize data for presentation at research conferences and for manuscript preparation.
  • Supervise other personnel in the laboratory to coordinate research efforts as needed.
  • Perform pertinent scientific literature reviews as needed.
  • Assist with the ordering and procurement of computational infrastructure and equipment and with general team coordination as needed.
  • Provide expertise in standardization, storage, and management of large-scale geospatial / environmental data sets.
  • Collaborate to maintain a workplace that embraces teamwork and inclusivity.

SKILLS & COMPETENCIES REQUIRED :

  • Master's Degree in Geoinformatics, Urban Planning, or a related discipline with a focus on Geospatial Science.
  • Experience in spatial and temporal methods, Geoinformatics, or Data Mining.
  • Experience in multi-scale computational models and one or more statistical / programming languages (e.g., R, C++, Python).
  • Familiarity with collaborative scientific computing and version control systems.
  • Strong technical / scientific writing, interpersonal, verbal communication, presentation, time-management, planning, problem-solving, and organizational skills.
  • Ability to work as part of a diverse team and promote collaboration and cooperation among teams.
  • Demonstrated ability to work and make decisions independently in a fast-paced academic environment.

Preferred Qualifications :

  • PhD Degree in Geoinformatics, Urban Planning, or a related discipline with a focus on Geospatial Science.
  • Relevant work experience, including full-time postdoctoral experience in an exposome research lab.
  • Fluency in domain-specific libraries (e.g., sf, terra, geopandas).
  • Experience developing and implementing large-scale data analytics pipelines on real-world data.
  • Experience in geospatial database design and implementation experience with geostatistics and spatial interaction modeling techniques experience with high-performance cluster computing and / or cloud computing at scale.
  • Ability to design and execute research agenda.
  • Experience with applying Generative AI model specialization techniques (e.g., SFT, RLHF).
  • Strong publication record.
  • Experience with software engineering and developing user-friendly interfaces.
  • Experience with open science practices and data management tools that facilitate reproducible science (e.g., PositCloud, Google Colab).
  • Experience with informatics / medical ontologies.

Massachusetts General Hospital is an Affirmative Action Employer. By embracing diverse skills, perspectives and ideas, we choose to lead.

All qualified applicants will receive consideration for employment without regard to race, color, religious creed, national origin, sex, age, gender identity, disability, sexual orientation, military service, genetic information, and / or other status protected under law.

We will ensure that all individuals with a disability are provided a reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment.

J-18808-Ljbffr

1 day ago
Related jobs
Promoted
Dyne Therapeutics
Waltham, Massachusetts

The Principal Scientist, Bioinformatics Data Science, is a critical contributor to data analytics and engineering focused on accelerating our R&D programs and expanding our FORCE™ therapeutic delivery platform. Working alongside biological, platform, and translational scientists, the Scientist w...

Promoted
XPO
Boston, Massachusetts

As a Data Scientist, you will leverage data analytics and modelling techniques to improve overall operational performance and company financials. ...

Promoted
Best Buy Health
Boston, Massachusetts

Identify data sources, gaps in data, and metrics of value, considerations for preprocessing data, and potential internal improvements. Senior Clinical Data Scientist. Category: , Keywords: Data Scientist. You will be responsible for collecting, analyzing, and presenting data that will drive critical...

Promoted
Society of Exploration Geophysicists
Boston, Massachusetts

The Technology Data Science & Data Governance team is looking for a highly motivated Data Scientist to join at an exciting time as we embark on developing AI/Machine Learning tools to support Technology Enablement portfolios, employee digital experience and productivity across all Lines of Busin...

Promoted
Amazon Web Services (aws)
Boston, Massachusetts

This is a team of data scientists, engineers, and architects working step-by-step with customers to build bespoke solutions that harness the power of generative AI. In this Data Scientist role you will be capable of using GenAI and other techniques to design, evangelize, and implement and scale cutt...

Promoted
DICE
Boston, Massachusetts

Sr Data Scientist / Mapping Analyst. This team will need to determine the mapping of data fields from the old to new system, as well as determine definition mapping, gap analysis of fields/data/definitions, and other activities relating to moving to a new system. There is currently a database / data...

SynergisticIT
Boston, Massachusetts
Remote

SynergisticIT - Home of the Best Data Scientists and Software Programmers in the Bay Area. All companies who work with SynergisticIT can rest assured their confidential data is protected using the most up-to-date encryption technologies. For data Science/Machine learning. Knowledge of Statistics, SA...

CVS Health
Wellesley, Massachusetts

Strong knowledge of advanced analytics tools and languages to analyze large data sets from multiple data sources. Collaborate with business partners to understand their problems and goals; develop predictive modeling, statistical analysis, data reports and performance metrics; and present recommenda...

AMEX
Boston, Massachusetts

Professional experience in writing code in structured programming languages (Python, R), writing scripts using GBQ or SQL, creating Pivot Tables and Vlookups in Google Sheets or Excel, and using data visualization tools such as Data Studio, Looker, Tableau or PowerBI. We are looking for advanced ana...

CVS Health
Wellesley, Massachusetts

Duties include: design and develop data solutions using industry leading tools, technologies and best practices to profile data and develop efficient ingestion by sourcing data from PBM, Specialty, Retail, and/or HealthCare business; develop advanced algorithms and statistical predictive models to e...