Research Engineer, Safety Reasoning

OpenAI

San Francisco

Full-time

About the Team

The team is at the forefront of OpenAI's mission to build and deploy safe AGI, driving our commitment to AI safety and fostering a culture of trust and transparency.

The Safety Reasoning Research team is poised at the intersection of short-term pragmatic projects and long-term fundamental research, prioritizing rapid system development while maintaining technical robustness.

Key focus areas include improving foundational models’ ability to accurately reason about safety, values, and questions of cultural norms, refining moderation models, driving rapid policy improvements, and addressing critical societal challenges like election misinformation.

As we venture into , the team seeks talents adept in novel abuse discovery and policy iteration, aligning with our high-priority goals of multimodal moderation and ensuring digital safety.

About the Role

The role involves developing innovative machine learning techniques that push the limit of our foundation model’s safety understanding and capability.

You will engage in defining and developing realistic and impactful safety tasks that, once improved, can be integrated into OpenAI's safety systems or benefit other safety / alignment research initiatives.

Examples of safety initiatives include moderation policy enforcement, policy development using democratic input, and safety reward modeling.

You will be experimenting with a wide range of research techniques not limited to reasoning, architecture, data, and multimodal.

In this role, you will :

Conduct applied research to improve the ability of foundational models to accurately reason about questions of human values, morals, ethics, and cultural norms, and apply these improved models to practical safety challenges.

Develop and refine AI moderation models to detect and mitigate known and emerging patterns of AI misuse and abuse.

Work with policy researchers to adapt and iterate on our content policies to ensure effective prevention of harmful behavior.

Contribute to research on multimodal content analysis to enhance our moderation capabilities.

Develop and improve pipelines for automated data labeling and augmentation, model training, evaluation and deployment, including active learning process, routines for calibration and validation data refresh etc.

Experiment and design an effective red-teaming pipeline to examine the robustness of our harm prevention systems and identify areas for future improvement.

You might thrive in this role if you :

Are excited about of building safe, universally beneficial AGI and are aligned with

Possess 5+ years of research engineering experience and proficiency in Python or similar languages.

Thrive in environments involving large-scale AI systems and multimodal datasets (a plus).

Exhibit proficiency in the field of AI safety, focusing on topics like RLHF, adversarial training, robustness, fairness & biases, which is extremely advantageous.

Show enthusiasm for AI safety and dedication to enhancing the safety of cutting-edge AI models for real-world use.

30+ days ago

Related jobs

Research Engineer, Safety Reasoning

OpenAI

San Francisco, California

The Safety Reasoning Research team is poised at the intersection of short-term pragmatic projects and long-term fundamental research, prioritizing rapid system development while maintaining technical robustness. You will engage in defining and developing realistic and impactful safety tasks that, on...

Promoted

Director of Global Patient Safety

VirtualVocations

Oakland, California

A company is looking for a Director, Global Patient Safety Sciences. ...

Promoted

Medical Director of Quality & Safety Cancer Center

University of California - San Francisco

San Francisco, California

Helen Diller Family Comprehensive Cancer Center (HDFCCC) at the University of California, San Francisco is seeking a Medical Director of Quality and Safety at 75% effort to provide leadership, clinical administration, and oversight in the areas of quality and safety initiatives across the Cancer Cen...

Promoted

SYSTEMWIDE ENVIRONMENTAL HEALTH and SAFETY ADMIN OFFICER

University of California Office of the President

Oakland, California

ENTERPRISE SAFETY MOUs CONTACT AND BUDGETING * Work with various Enterprise Safety programs (e. The Office of Risk Services is responsible for developing and implementing Enterprise Risk Management systemwide, identifying and developing strategies to minimize the impact of risk, developing a center ...

Promoted

Associate Civil Engineer (Wastewater Project and Construction Management) (Temporary Construction)

East Bay Municipal Utility District

Oakland, California

Managing the project construction phase by preparing and understanding construction contract documents (plans and specifications), understanding construction techniques, reviewing submittals and requests for information, negotiating change orders and claims, preparing cost estimates and project sche...

Promoted

Senior Electrical Project Engineer - Power Systems

Affiliated Engineers

San Francisco, California

Electrical Project Engineer - Power Systems. In this role, you will build and maintains relationships with clients and project partners, and will be responsible for the overall power systems design process on assigned project. You will develop innovative and unique design solutions to align with pro...

Promoted

Senior Director, Drug Safety Operations and Compliance

IDEAYA Biosciences

South San Francisco, California

The Senior Director, Safety Operations and Compliance will serve as a safety expert accountable for the oversees planning, implementation, and management of Safety Operations and Compliance activities in support of the Ideaya Biosciences development portfolio, in close collaboration with other Safet...

Promoted

Project Delivery Manager - SAP Finance Lead

Deloitte

San Francisco, California

Communicate regularly with Engagement Managers (Directors), project team members, and representatives from various functional and / or technical teams, including escalating any matters that require additional attention and consideration from engagement management. If so, consider an opportunity with...

Promoted

Chief Operating Officer

Center for AI Safety

San Francisco, California

The Center for AI Safety (CAIS) is seeking a strategic Chief Operating Officer (COO) to partner with the Director, Dan Hendrycks, to establish and drive strategic initiatives that will shape the future of AI safety, ensuring that our research translates into real-world impact and influences global p...

Promoted

Safety Manager

Peterson

San Leandro, California

Responsible to oversee all aspects of occupational safety in designated region, implementing safety programs, ensuring compliance with government and industry standards, and communicating safety related matters to Director of Product Support Operations and other senior managers. Establish and mainta...

Research Engineer, Safety Reasoning

Research Engineer, Safety Reasoning

Director of Global Patient Safety

Medical Director of Quality & Safety Cancer Center

SYSTEMWIDE ENVIRONMENTAL HEALTH and SAFETY ADMIN OFFICER

Associate Civil Engineer (Wastewater Project and Construction Management) (Temporary Construction)

Senior Electrical Project Engineer - Power Systems

Senior Director, Drug Safety Operations and Compliance

Project Delivery Manager - SAP Finance Lead

Chief Operating Officer

Safety Manager

Related searches