OpenAI

Company BlogHigh(4)

GPT developer, leading AI lab

Resources

104

Citing pages

119

Tracked domains

Credibility Rating

4/5

High(4)

High quality. Established institution or organization with editorial oversight and accountability.

Tracked Domains

openai.com

Resources (104)

104 resources

		Authors	Summary
OpenAI Official Homepage	web	—	S	24
OpenAI Preparedness Framework	web	—	S	19
OpenAI: Model Behavior	paper	Rakshith Purushothaman	S	15
OpenAI Safety Updates	web	—	S	13
GPT-4 System Card	web	—	S	8
Preparedness Framework	web	—	S	7
Weak-to-strong generalization	web	—	S	7
OpenAI Preparedness Framework	web	—	S	6
GPT-4 Technical Report and Research Overview	web	—	S	5
announced December 2024	web	—	S	5
Safety & responsibility	web	—	S	4
Introducing Superalignment	web	—	S	4
OpenAI Superalignment Fast Grants	web	—	S	4
2025 OpenAI-Anthropic joint evaluation	web	—	S	4
SWE-bench Verified - OpenAI	web	—	S	4
OpenAI - How We Think About Safety Alignment	web	—	S	3
OpenAI's alignment research	web	—	S	3
Learning to Reason with LLMs: OpenAI o1	web	—	S	3
OpenAI CoT Monitoring	web	—	S	3
OpenAI Usage Policies	web	—	S	3
OpenAI: Preparedness Framework Version 2	web	—	S	3
Deliberative alignment: reasoning enables safer language models	web	—	S	3
Extracting Concepts from GPT-4	web	—	S	3
OpenAI on detection limits	web	—	S	2
Learning to summarize with human feedback	web	—	S	2

Rows per page:

Page 1 of 5

Citing Pages (119)

AI Accident Risk Cruxes Agentic AI AGI Development AGI Timeline AI-Assisted Alignment AI Alignment Alignment Evaluations Apollo Research Alignment Research Center (ARC)Authentication Collapse Bioweapons Risk AI Uplift Assessment Model Center for AI Safety (CAIS)Capabilities-to-Safety Pipeline Model Capability Elicitation AI Capability Threshold Model Autonomous Coding Compute Concentration AI-Driven Concentration of Power Constitutional AI Corporate AI Safety Responses Corrigibility Failure AI Risk Critical Uncertainties Model Cyberweapons Risk Dangerous Capability Evaluations Deceptive Alignment Deep Learning Revolution Era AI Safety Defense in Depth Model AI Disinformation Elicit (AI Research Tool)Emergent Capabilities Epistemic Sycophancy EU AI Act Eval Saturation & The Evals Gap AI Evaluations Evals-Based Deployment Gates AI Evaluation Goal Misgeneralization Probability Model AI Governance and Policy Heavy Scaffolding / Agentic Systems AI-Human Hybrid Systems Instrumental Convergence Instrumental Convergence Framework Is Interpretability Sufficient for Safety?AI Safety Intervention Effectiveness Matrix AI Safety Intervention Portfolio Jan Leike AI Knowledge Monopoly AI Lab Safety Culture Frontier AI Labs (Overview)Large Language Models Large Language Models AI Value Lock-in Long-Horizon Autonomous Tasks Mesa-Optimization Mesa-Optimization Risk Analysis Metaculus METR Minimal Scaffolding AI Misuse Risk Cruxes Third-Party Model Auditing Multipolar Trap Dynamics Model OpenAI OpenAI Board and Foundation Dynamics OpenAI Foundation Optimistic Alignment Worldview Paul Christiano Should We Pause AI Development?Persuasion and Social Manipulation Process Supervision AI Proliferation AI Proliferation Risk Model AI Development Racing Dynamics Racing Dynamics Impact Model Reasoning and Planning Red Teaming Reducing Hallucinations in AI-Generated Wiki Content AI Alignment Research Agendas Reward Hacking Reward Hacking Taxonomy and Severity Model AI Risk Activation Timeline Model AI Risk Cascade Pathways Model AI Risk Interaction Network Model RLHF Safety-Capability Tradeoff Model AI Safety Cases AI Safety Research Allocation Model AI Safety Research Value Model AI Safety Researcher Gap Model Sam Altman AI Capability Sandbagging Sandboxing / Containment Scalable Eval Approaches Scalable Oversight Is Scaling All You Need?AI Scaling Laws Scheming Scheming & Deception Detection Scheming Likelihood Assessment Self-Improvement and Recursive Enhancement Sharp Left Turn Situational Awareness Sleeper Agent Detection AI Safety Solution Cruxes Sparse Autoencoders (SAEs)AI Model Steganography Superintelligence Sycophancy AI Safety Technical Pathway Decomposition Technical AI Safety Research Compute Thresholds Tool Use and Computer Use Treacherous Turn Voluntary AI Safety Commitments AI Risk Warning Signs Model Weak-to-Strong Generalization Why Alignment Might Be Hard World Models + Planning Worldview-Intervention Mapping

Publication ID: openai