GenAI Senior Data Scientist - Data Scientist

Simple Solutions
Jacksonville, Oregon, USA
Full-time

GenAI Senior Data Scientist

There will be 2 Interviews and then a 23 hour take home coding assignment that they must have run the code before an interview with the team architect and then the Director.

As a Senior Data Scientists with 710 years experience in LATAM you will collaborate with the Data Science and Machine Learning team in USA and will create data science machine learning and AI solutions to better address the needs of our constituents (students alumni faculty researchers staff and community at large).

You will experiment with everything from the latest AI algorithms and techniques to blended and immersive environments multimodal and variedform content and the most innovative research and teaching methodologies.

You will be highly influential in advancing our LLM applications and guide teams towards impactful and ethical AI. We seek an expert who is eager to grow and disseminate GenAI model expertise across the organization.

In this role you will translate the needs of our crossfunctional stakeholders into userfacing applications that leverage NLP techniques and large language models (LLMs).

As a Sr. Data Scientist on our GenAI applications team you will work on products like conversational search interfaces chatbots text summarizers recommender engines and more based on the needs of the constituents.

You will partner with Product Managers Machine Learning Engineers Cloud Platform Engineers and crossfunctional partners to develop productiongrade algorithms

Duties and Responsibilities :

Architect the overall framework and infrastructure for GenAI products like search interfaces bots summarizers etc. Develop and implement techniques to optimize model performance to meet specific product goals

Collaborate closely with product management and engineering leads to align on technical roadmap. Guide engineering teams to effectively leverage LLM capabilities in product implementations

Establish protocols and systems for building fair accountable and transparent LLM1based applications. Lead efforts to proactively assess and mitigate risks due to model biases or failures Implement robust feedback pipelines monitoring and corrections to ensure model safety

Design and oversee curation of highquality datasets tailored for LLM training for each product. Build data science pipelines from feature generation data visualization and models evaluation;

design the solution build initial code and provide documentation with ways of working to maximize time to value and reusability.

Communicate clearly and effectively to technical and nontechnical audiences verbally and visually to create understanding engagement and buyin.

Contribute novel research and analyses to leading academic conferences and journals.

Identify trends and opportunities to drive innovation both in what we do and how we do it; evaluate new data science machine learning and AI technologies and tools that can boost team performance innovation and business value.

Proactively analyze latest developments in large language models to deeply understand model capabilities limitations and best practices.

Develop techniques to continually improve language understanding and model training

Mentor and develop junior data scientists in stateoftheart GenAI methods

Set technical vision and lead initiatives to accelerate product impact through cutting1edge LLM innovations

Complete other responsibilities as assigned.

Required Skills and Qualifications :

Minimum of nine years postsecondary education or relevant work experience

Advanced degree in mathematics physics computer science engineering statistics or an equivalent technical discipline desired

Minimum of three years experience in developing machine learning models with a track record of creating meaningful business impact and working with multiple stakeholders.

Minimum of five years experience with Python.

Minimum of three years experience building production NLP and deep learning models using PyTorch / Tensorflow along with using large language model architectures (BERT GPT3 etc.)

Experience building advanced workflows such as retrieval augmented generation model chaining dynamic prompting PEFT / SFT etc.

using Langchain and similar tools

Experience establishing model guardrails and developing bias detection and mitigation techniques for AI applications

Proficiency with various prompting techniques with a clear understanding of tradeoffs between prompting and finetuning

Experience with finetuning embedding models and tuning vector databases to improve performance of semantic search and retrieval systems

Deep understanding of underlying fundamentals such as Transformers SelfAttention mechanisms that form the theoretical foundation of LLMs

Experience with cloud computing platforms and tools (AWS)

GenAI Senior Data Scientist There will be 2 Interviews and then a 2-3 hour take home coding assignment that they must have run the code before an interview with the team architect and then the Director.

As a Senior Data Scientist, you will collaborate with the Data Science and Machine Learning team and will create data science, machine learning, and AI solutions to better address the needs of our constituents (students, alumni, faculty, researchers, staff, and community at large).

You will experiment with everything from the latest AI algorithms and techniques to blended and immersive environments, multi-modal and varied-form content, and the most innovative research and teaching methodologies.

You will be highly influential in advancing our LLM applications and guide teams towards impactful and ethical AI. We seek an expert who is eager to grow and disseminate GenAI model expertise across the organization.

In this role, you will translate the needs of our cross-functional stakeholders into user-facing applications that leverage NLP techniques and large language models (LLMs).

As a Sr. Data Scientist on our GenAI applications team, you will work on products like conversational search interfaces, chatbots, text summarizers, recommender engines, and more based on the needs of the constituents.

You will partner with Product Managers, Machine Learning Engineers, Cloud Platform Engineers, and cross-functional partners to develop production-grade algorithms Duties and Responsibilities : Architect the overall framework and infrastructure for GenAI products like search interfaces, bots, summarizers, etc.

Develop and implement techniques to optimize model performance to meet specific product goals Collaborate closely with product management and engineering leads to align on technical roadmap.

Guide engineering teams to effectively leverage LLM capabilities in product implementations Establish protocols and systems for building fair, accountable and transparent LLM 1 based applications.

Lead efforts to proactively assess and mitigate risks due to model biases or failures Implement robust feedback pipelines, monitoring and corrections to ensure model safety Design and oversee curation of high-quality datasets tailored for LLM training for each product.

Build data science pipelines from feature generation, data visualization and models evaluation; design the solution, build initial code and provide documentation with ways of working to maximize time to value and re-usability.

Communicate clearly and effectively to technical and non-technical audiences, verbally and visually, to create understanding, engagement, and buy-in.

Contribute novel research and analyses to leading academic conferences and journals. Identify trends and opportunities to drive innovation, both in what we do and how we do it;

evaluate new data science, machine learning, and AI technologies and tools that can boost team performance, innovation and business value.

Proactively analyze latest developments in large language models to deeply understand model capabilities, limitations, and best practices.

Develop techniques to continually improve language understanding and model training Mentor and develop junior data scientists in state-of-the-art GenAI methods Set technical vision and lead initiatives to accelerate product impact through cutting 1 edge LLM innovations Complete other responsibilities as assigned.

Required Skills and Qualifications : Minimum of nine years post-secondary education or relevant work experience Advanced degree in mathematics, physics, computer science, engineering, statistics, or an equivalent technical discipline desired Minimum of three years experience in developing machine learning models with a track record of creating meaningful business impact and working with multiple stakeholders.

Minimum of five years experience with Python. Minimum of three years ' experience building production NLP and deep learning models using PyTorch / Tensorflow, along with using large language model architectures (BERT, GPT-3 etc.

Experience building advanced workflows such as retrieval augmented generation, model chaining, dynamic prompting, PEFT / SFT, etc.

using Langchain and similar tools Experience establishing model guardrails and developing bias detection and mitigation techniques for AI applications Proficiency with various prompting techniques, with a clear understanding of tradeoffs between prompting and finetuning Experience with finetuning embedding models and tuning vector databases to improve performance of semantic search and retrieval systems Deep understanding of underlying fundamentals such as Transformers, Self-Attention mechanisms that form the theoretical foundation of LLMs Experience with cloud computing platforms and tools (AWS)

Education

Master's preferred and Bachelors is required at minimum

8 days ago
Related jobs
Simple Solutions
Jacksonville, Oregon

As a Senior Data Scientists with 710 years experience in LATAM you will collaborate with the Data Science and Machine Learning team in USA and will create data science machine learning and AI solutions to better address the needs of our constituents (students alumni faculty researchers staff and com...

Promoted
CareOregon
Medford, Oregon

ERR_DATA_EXPECTED:"(expected {0})",ERR_DATA_FORMAT:"{0} is invalid {1}",ERR_DATA_OUT_OF_RANGE:"{0} out of range ({1} - {2})",ERR_DATA_REQUIRED:"{0} is required {1}",ERR_DATA_TOO_LONG:"{0} is too long ({1} characters exceeds the maximum of {2})",JOBREQ_RTE_FIELD_KEY:"Field Key",JOBREQ_RTE_FIELD_LABEL...

Simple Solutions
Jacksonville, Oregon

As a Senior Data Scientists with 710 years experience in LATAM youwill collaborate with the Data Science and Machine Learning team inUSA and will create data science machine learning and AI solutionsto better address the needs of our constituents (students alumnifaculty researchers staff and communi...

Promoted
NIC Industries
White City, Oregon

NIC Industries is seeking a Senior Scientist to join our Research and Development team. Deliver timely and accurate project reports, technical documentation, and presentations to senior leadership. ...

Splunk Inc
Oregon, United States
Remote

Learn more about Splunk careers and how you can become a part of our journey!Role:As a Senior Applied Scientist in the Artificial Intelligence group, you will be responsible for developing the core AI/ML capabilities to power the entire Splunk product portfolio and help our customers to drive their ...

Highmark Health
OR, Working at Home, Oregon

This job consults with cross-functional groups and levels to identify data and analytical needs, conduct analyses, review analyses and findings with leadership, facilitate related process/data improvement efforts, and develop executive level presentations. Develop and implement data collection syste...

Glumac
US, Oregon, United States

Senior Electrical Engineer- Mission Critical Data Center Program Lead. Our mission is to engineer and commission “green buildings and data center campuses that work. The demands of artificial intelligence technologies and the data centers needed to facilitate them is putting increasing focus on CO2 ...

Providence
OR, United States
Remote

Providence Health Plan is calling a Senior Manager Data Analytics who will:. Remain current on analytic techniques and methodologies, data sources, data available within the organization and other public and proprietary sources. Serve as main point of contact to provide data insight that support sha...

Highmark Health
OR, Working at Home, Oregon

Align with security, data governance and data quality programs by driving assigned components of metadata management, data quality management, and the application of business rules. These may be based on prototypes built by data scientists or capability frameworks implemented to allow data scientist...

Highmark Health
OR, Working at Home, Oregon

The incumbent delivers data profiling, conducts testing and systems validation to troubleshoot data anomalies, monitors data management metrics and data loads. This job understands healthcare data from end-to-end and analyzes raw data and analytic data for the enterprise. Verify analytic data for th...