Generalist, Responsible Scaling Team
Company: Lionheart Ventures
Location: San Francisco
Posted on: October 16, 2024
Job Description:
Anthropic's Responsible Scaling PolicyLast summer we published
our first Responsible Scaling Policy (RSP), which focuses on
addressing catastrophic safety failures and misuse. In adopting
such a policy, our primary goal has been to help turn high-level
safety concepts into practical policies for fast-moving technical
organizations and demonstrate the viability of these measures as
possible standards.Our Responsible Scaling Policy has been a
powerful rallying point with many teams' work over the last six
months connecting directly back to major RSP workstreams. The
progress we have made has required significant work from teams
across Anthropic and there is much more work to be done. Our new
Responsible Scaling Team will:
- Help leadership align on a practical approach to scaling
responsibly that will raise the safety waterline in industry,
inform regulation, and mitigate catastrophic risks from models
- Rally teams internally to operationalize and implement this
technical roadmap and set of high-level commitments, making object
level decisions as needed
- Iterate internally on different approaches to safety
challenges, feeding these learnings back into the high-level
policy, and sharing our learnings with industry and policymakersAs
we continue to iterate on and improve the original policy, we are
actively exploring ways to incorporate practices from existing risk
management and operational safety domains. While none of these
domains alone will be perfectly analogous, we expect to find
valuable insights from nuclear security, biosecurity, systems
safety, autonomous vehicles, aerospace, and cybersecurity. We
intend to build an interdisciplinary team to help us integrate the
most relevant and valuable practices from each.Note: For this role,
we are looking for candidates who can start within 3 months. We
will consider all candidates who can meet the organization's hybrid
policy, provided you have significant (60%+) overlap with Pacific
Time.About this TeamWe are a rapidly growing, multidisciplinary
team working to operationalize Anthropic's Responsible Scaling
Policy (RSP) and implement cutting-edge safety measures across the
organization. Our work spans threat modeling and AI safety
evaluations, technical and operational risk mitigation, and
governance frameworks for responsible AI development. We are
looking for exceptional individuals with a passion for AI safety
and a track record of driving impactful change. As our team
continues to expand, we anticipate a range of opportunities across
our core RSP workstreams:
- Capacity Evaluations: Designing and executing rigorous testing
frameworks to proactively identify and assess potential risks in
Anthropic's AI systems
- Safety Mitigations: Translating AI safety principles into
practical, implementable measures and driving the development and
rollout of cutting-edge safeguards
- Risk Management & Governance: Providing independent oversight
and assurance to ensure Anthropic's RSP commitments are upheld with
the highest degree of integrityWe are not currently hiring
Generalists, as we are focused on hiring leads for each of these
three work streams: Safety Case Specialist (Capability
Evaluations), Safety Case Specialist (Safety Mitigations), and Risk
Manager. However, we are always eager to connect with talented
individuals who share our mission and values. If you have a strong
technical background, exceptional project management or systems
engineering skills, and a demonstrated commitment to AI safety, we
would love to hear from you. By submitting an application here,
you'll be among the first to know about new opportunities on the
Responsible Scaling Team as they become available.
#J-18808-Ljbffr
Keywords: Lionheart Ventures, Milpitas , Generalist, Responsible Scaling Team, Other , San Francisco, California
Didn't find what you're looking for? Search again!
Loading more jobs...