Search jobs > San Francisco, CA > Safety engineer

Research Engineer, Safety Reasoning

OpenAI
San Francisco
Full-time

About the Team

The team is at the forefront of OpenAI's mission to build and deploy safe AGI, driving our commitment to AI safety and fostering a culture of trust and transparency.

The Safety Reasoning Research team is poised at the intersection of short-term pragmatic projects and long-term fundamental research, prioritizing rapid system development while maintaining technical robustness.

Key focus areas include improving foundational models’ ability to accurately reason about safety, values, and questions of cultural norms, refining moderation models, driving rapid policy improvements, and addressing critical societal challenges like election misinformation.

As we venture into , the team seeks talents adept in novel abuse discovery and policy iteration, aligning with our high-priority goals of multimodal moderation and ensuring digital safety.

About the Role

The role involves developing innovative machine learning techniques that push the limit of our foundation model’s safety understanding and capability.

You will engage in defining and developing realistic and impactful safety tasks that, once improved, can be integrated into OpenAI's safety systems or benefit other safety / alignment research initiatives.

Examples of safety initiatives include moderation policy enforcement, policy development using democratic input, and safety reward modeling.

You will be experimenting with a wide range of research techniques not limited to reasoning, architecture, data, and multimodal.

In this role, you will :

Conduct applied research to improve the ability of foundational models to accurately reason about questions of human values, morals, ethics, and cultural norms, and apply these improved models to practical safety challenges.

Develop and refine AI moderation models to detect and mitigate known and emerging patterns of AI misuse and abuse.

Work with policy researchers to adapt and iterate on our content policies to ensure effective prevention of harmful behavior.

Contribute to research on multimodal content analysis to enhance our moderation capabilities.

Develop and improve pipelines for automated data labeling and augmentation, model training, evaluation and deployment, including active learning process, routines for calibration and validation data refresh etc.

Experiment and design an effective red-teaming pipeline to examine the robustness of our harm prevention systems and identify areas for future improvement.

You might thrive in this role if you :

Are excited about of building safe, universally beneficial AGI and are aligned with

Possess 5+ years of research engineering experience and proficiency in Python or similar languages.

Thrive in environments involving large-scale AI systems and multimodal datasets (a plus).

Exhibit proficiency in the field of AI safety, focusing on topics like RLHF, adversarial training, robustness, fairness & biases, which is extremely advantageous.

Show enthusiasm for AI safety and dedication to enhancing the safety of cutting-edge AI models for real-world use.

30+ days ago
Related jobs
OpenAI
San Francisco, California

The Safety Reasoning Research team is poised at the intersection of short-term pragmatic projects and long-term fundamental research, prioritizing rapid system development while maintaining technical robustness. You will engage in defining and developing realistic and impactful safety tasks that, on...

Promoted
VirtualVocations
Oakland, California

A company is looking for a Director, Global Patient Safety Sciences. ...

Promoted
University of California - San Francisco
San Francisco, California

Helen Diller Family Comprehensive Cancer Center (HDFCCC) at the University of California, San Francisco is seeking a Medical Director of Quality and Safety at 75% effort to provide leadership, clinical administration, and oversight in the areas of quality and safety initiatives across the Cancer Cen...

Promoted
University of California Office of the President
Oakland, California

ENTERPRISE SAFETY MOUs CONTACT AND BUDGETING * Work with various Enterprise Safety programs (e. The Office of Risk Services is responsible for developing and implementing Enterprise Risk Management systemwide, identifying and developing strategies to minimize the impact of risk, developing a center ...

Promoted
East Bay Municipal Utility District
Oakland, California

Managing the project construction phase by preparing and understanding construction contract documents (plans and specifications), understanding construction techniques, reviewing submittals and requests for information, negotiating change orders and claims, preparing cost estimates and project sche...

Promoted
Affiliated Engineers
San Francisco, California

Electrical Project Engineer - Power Systems. In this role, you will build and maintains relationships with clients and project partners, and will be responsible for the overall power systems design process on assigned project. You will develop innovative and unique design solutions to align with pro...

Promoted
IDEAYA Biosciences
South San Francisco, California

The Senior Director, Safety Operations and Compliance will serve as a safety expert accountable for the oversees planning, implementation, and management of Safety Operations and Compliance activities in support of the Ideaya Biosciences development portfolio, in close collaboration with other Safet...

Promoted
Deloitte
San Francisco, California

Communicate regularly with Engagement Managers (Directors), project team members, and representatives from various functional and / or technical teams, including escalating any matters that require additional attention and consideration from engagement management. If so, consider an opportunity with...

Promoted
Center for AI Safety
San Francisco, California

The Center for AI Safety (CAIS) is seeking a strategic Chief Operating Officer (COO) to partner with the Director, Dan Hendrycks, to establish and drive strategic initiatives that will shape the future of AI safety, ensuring that our research translates into real-world impact and influences global p...

Promoted
Peterson
San Leandro, California

Responsible to oversee all aspects of occupational safety in designated region, implementing safety programs, ensuring compliance with government and industry standards, and communicating safety related matters to Director of Product Support Operations and other senior managers. Establish and mainta...