Research Scientist, Interpretability

2 days ago

🏢 In-office - Manhattan

Apply Now
Logo of Anthropic

Anthropic

Anthropic is an AI safety and research company working to build reliable, interpretable, and steerable AI systems.

51 - 200

Description

• Develop methods for understanding LLMs by reverse engineering algorithms learned in their weights. • Design and run robust experiments, both quickly in toy scenarios and at scale in large models. • Build infrastructure for running experiments and visualizing results. • Work with colleagues to communicate results internally and publicly.

Requirements

• Have a strong track record of scientific research (in any field), and have done some work on Interpretability. • Enjoy team science – working collaboratively to make big discoveries. • Are comfortable with messy experimental science. We're inventing the field as we work, and the first textbook is years away. • You view research and engineering as two sides of the same coin. Every team member writes code, designs and runs experiments, and interprets results. • You can clearly articulate and discuss the motivations behind your work, and teach us about what you've learned. You like writing up and communicating your results, even when they're null. • Familiarity with Python is required for this role.

Benefits

• Optional equity donation matching. • Comprehensive health, dental, and vision insurance for you and all your dependents. • 401(k) plan with 4% matching. • 22 weeks of paid parental leave. • Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more! • Stipends for education, home office improvements, commuting, and wellness. • Fertility benefits via Carrot. • Daily lunches and snacks in our office. • Relocation support for those moving to the Bay Area. • Private health, dental, and vision insurance for you and your dependents. • Pension contribution (matching 4% of your salary). • 21 weeks of paid parental leave. • Health cash plan. • Life insurance and income protection.

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@techjobsnewyorkcity.com