Steven  Basart

Steven Basart

View LinkedIn

Education

broward college
university of miami
university of chicago

Previous Companies

spacexGoogleUniversity of Chicago

Companies this founder has worked for or been associated with

Interests & Hobbies

Robotics(casual)

Confidence: 89%

Other(casual)

Confidence: 88%

Location

Greater Chicago Areasan francisco bay area, californiasan francisco, california

Sample Data Sources

Showing 3 of 3+ data sources

default

Research Publications – Center for Human-Compatible Artificial Intelligence

CHAI aims to reorient the foundations of AI research toward the development of provably beneficial systems. Currently, it is not possible to specify a formula for human values in any form that we know

default

Automating expert-level medical reasoning evaluation of large language models | npj Digital MedicineClose bannerClose banner

As large language models (LLMs) become increasingly integrated into clinical decision-making, ensuring trustworthy reasoning is paramount. However, current evaluation strategies of LLMs’ medical reasoning capability either suffer from unsatisfactory assessment or poor scalability, and a rigorous benchmark remains absent. To address this, we present MedThink-Bench, a benchmark designed for rigorous and scalable assessment of LLMs’ medical reasoning. MedThink-Bench comprises 500 high-complexity questions spanning ten medical domains, accompanied by expert-authored, step-by-step rationales that elucidate intermediate reasoning processes. Further, we introduce LLM-w-Rationale, an evaluation framework that combines fine-grained rationale assessment with an LLM-as-a-Judge paradigm, enabling expert-level fidelity in evaluating reasoning quality while preserving scalability. Results show that LLM-w-Rationale correlates strongly with expert evaluation (Pearson coefficient up to 0.87) while requiring only 1.4% of the evaluation time. Overall, MedThink-Bench establishes a rigorous and scalable standard for evaluating medical reasoning in LLMs, advancing their safe and responsible deployment in clinical practice.

default

Center for AI Safety Company Profile | Management and Employees List

Find contact information for Center for AI Safety. Learn about their Government market share, competitors, and Center for AI Safety's email format.

Get Full Report

Access all 3+ data sources, detailed research, and comprehensive background information for this founder and all YC Batch W26 founders.

Book a Demo for Full Data Access