Search jobs > Raleigh, NC > Remote > Temporary > Sr data scientist

Sr Data Scientist - REMOTE

Simple Solutions
Raleigh, USA
Remote
Full-time

This is a remote position.

Share only 12 years Sr profiles

Senior Data Scientist II

Raleigh NC (Hybrid) Remote is also fine.

As a data scientist on our team you will work on new product development in a small team environment writing production code in both runtime and buildtime environments.

You will help propose and build datadriven solutions for highvalue customer problems by discovering extracting and modeling knowledge from largescale natural language datasets including matter and contract repository invoice / legal spend data and work management.

You will prototype new ideas collaborating with other data scientists as well as product designers data engineers frontend developers and a team of expert legal data annotators.

You will get the experience of working in a startup culture with the large datasets and many other resources of an established company.

RESPONSIBILITIES

Develop and implement LLMbased applications tailored for inhouse legal

Finetune and deploy large language models to enhance their performance on legal text processing tasks

Evaluate and help maintain our data assets and training / evaluation data sets

Design and build pipelines for preprocessing annotating and managing legal document datasets

Collaborate with legal experts to understand requirements and ensure models meet domainspecific needs

Conduct experiments and evaluate model performance to drive continuous improvements

Interface with other technical personnel or team members to finalize requirements.

Work closely with other development team members to understand moderately complex product requirements and translate them into software designs.

Successfully implement development processes coding best practices and code reviews for production environments.

REQUIREMENTS

Formal training in machine learning : dimensionality reduction clustering embeddings and sequence classification algorithms

Experience with deep learning frameworks such as PyTorch Tensorflow and Hugging Face Transformers.

Practical experience in Natural Language Processing methods and libraries such as spaCy word2vec TensorFlow Keras PyTorch Flair BERT

Practical experience with large language models prompt engineering finetuning and benchmarking using frameworks such as LangChain and LlamaIndex

Strong Python background

Knowledge of AWS GCP Azure or other cloud platform

Understanding of data modeling principles and complex data models.

Proficiency with relational and NoSQL databases as well as vector stores (e.g. Postgres Elasticsearch / OpenSearch ChromaDB)

Knowledge of Scala Spark Ray or other distributed computing systems highly preferred

Knowledge of API development containerization and machine learning deployment highly preferred

Experience with ML Ops / AI Ops highly preferred

PREFERRED QUALIFICATIONS

MS in Data Science Computer Science Statistics Machine Learning or related field

2 years of relevant work experience

Or undergraduate degree in relevant field and 4 years of relevant work experience

Senior Data Scientist II Raleigh, NC (Hybrid) Remote is also fine. As a data scientist on our team, you will work on new product development in a small team environment writing production code in both run-time and build-time environments.

You will help propose and build data-driven solutions for high-value customer problems by discovering, extracting, and modeling knowledge from large-scale natural language datasets including matter and contract repository, invoice / legal spend data and work management.

You will prototype new ideas, collaborating with other data scientists as well as product designers, data engineers, front-end developers, and a team of expert legal data annotators.

You will get the experience of working in a start-up culture with the large datasets and many other resources of an established company.

RESPONSIBILITIES Develop and implement LLM-based applications tailored for in-house legal Fine-tune and deploy large language models to enhance their performance on legal text processing tasks Evaluate and help maintain our data assets and training / evaluation data sets Design and build pipelines for preprocessing, annotating, and managing legal document datasets Collaborate with legal experts to understand requirements and ensure models meet domain-specific needs Conduct experiments and evaluate model performance to drive continuous improvements Interface with other technical personnel or team members to finalize requirements.

Work closely with other development team members to understand moderately complex product requirements and translate them into software designs.

Successfully implement development processes, coding best practices, and code reviews for production environments. REQUIREMENTS Formal training in machine learning : dimensionality reduction, clustering, embeddings, and sequence classification algorithms Experience with deep learning frameworks such as PyTorch, Tensorflow and Hugging Face Transformers.

Practical experience in Natural Language Processing methods and libraries such as spaCy, word2vec, TensorFlow, Keras, PyTorch, Flair, BERT Practical experience with large language models, prompt engineering, fine-tuning and benchmarking, using frameworks such as LangChain and LlamaIndex Strong Python background Knowledge of AWS, GCP, Azure, or other cloud platform Understanding of data modeling principles and complex data models.

Proficiency with relational and NoSQL databases as well as vector stores (e.g., Postgres, Elasticsearch / OpenSearch, ChromaDB) Knowledge of Scala, Spark, Ray, or other distributed computing systems highly preferred Knowledge of API development, containerization, and machine learning deployment highly preferred Experience with ML Ops / AI Ops highly preferred PREFERRED QUALIFICATIONS MS in Data Science, Computer Science, Statistics, Machine Learning, or related field 2+ years of relevant work experience Or undergraduate degree in relevant field and 4+ years of relevant work experience

5 days ago
Related jobs
Promoted
VirtualVocations
Durham, North Carolina
Remote

Data Scientist - RemoteKey Responsibilities:Grow user base and increase retention through machine learning and analyticsBuild machine learning models with large scale data sets to address business prioritiesDesign and influence strategies on underwriting, marketing, fraud and customer experienceRequ...

Simple Solutions
Raleigh, North Carolina
Remote

You will prototypenew ideas collaborating with other data scientists as well asproduct designers data engineers frontend developers and a team ofexpert legal data annotators. You will prototype new ideas, collaborating with otherdata scientists as well as product designers, data engineers,front-end ...

Promoted
VirtualVocations
Durham, North Carolina

A company is looking for a Senior Data Scientist. ...

Thermo Fisher Scientific
North Carolina, United States of America
Remote

Identifies meaningful insights from large data and metadata sources in support of continuous improvement efforts and business process upgrades through exploratory data analysis. ...

Promoted
VirtualVocations
Raleigh, North Carolina
Remote

Key Responsibilities:Collaborate with a team to understand data and clinical modelsAnalyze and organize data from multiple sources using various toolsCreate data science products to enhance healthcare servicesRequired Qualifications:8+ years experience as a data scientist or analyst5+ years in data ...

PNC Bank NA
Raleigh, North Carolina

Data Architecture, Data Mining, Disruptive Innovation, Information Capture, Machine Learning, Modeling: Data, Process, Events, Objects, Prototyping, Query and Database Access Tools. Directs the data gathering, data processing and data mining of large and complex datasets. Partners with Data Architec...

SynergisticIT
Raleigh, North Carolina
Remote

Currently, we are looking for entry-level software programmers, Java Full stack developers, Python/Java developers, Data analysts/ Data Scientists, Data Engineers, Machine Learning engineers for full time positions with clients. We want Data Science/Machine learning/Data Analyst and Java Full stack ...

Next Pathway
Raleigh, North Carolina
Remote

Technical familiarity with cloud technologies, enterprise data warehouse concepts, data integration pipelines and reporting. Previous experience in EDW, Datawarehouse Migration or Data Integration projects. Senior Technical Project Manager (Cloud/Data Migrations/EDW). ...

Promoted
NetApp
Durham, North Carolina

As a Software Development Engineer in Test, you will be responsible for building NetApp’s cutting-edge software defined storage solutions for major Cloud providers including AWS, Azure and GCP. You will participate in aspects of the software development lifecycle including requirements, design, impl...

Promoted
VirtualVocations
Raleigh, North Carolina

A company is looking for a Business Intelligence Analyst Developer. ...