Published inData Science CollectiveHealthBench Does Not Evaluate Patient SafetyHealthBench is a recently released benchmark to evaluate large language models in healthcare. This blog post summarizes what HealthBench…May 13A response icon5May 13A response icon5
AI chatbots did not “defeat” doctors at diagnosing illnessA recent New York Times article titled AI Chatbots Defeated Doctors at Diagnosing Illness covers a study recently published in JAMA Network…Nov 20, 2024A response icon13Nov 20, 2024A response icon13
Published inTDS ArchiveHuman and Artificial General Intelligence Arises from Next Token PredictionWhat if human intelligence derives from successful next token prediction, and what if next token prediction is a sufficient objective…Apr 28, 2024A response icon9Apr 28, 2024A response icon9
Published inAI AdvancesIs “Good” AI Harder than “Bad” AI?Many definitions of AI alignment describe a goal of “making AI systems follow human values” while attempting to skirt difficult moral…Mar 29, 2024Mar 29, 2024
Published inAI AdvancesAI Alignment and Moral Philosophy for Artificial General IntelligenceIn this post I summarize a few of my thoughts on alignment and moral philosophy for safe artificial general intelligence.Mar 19, 2024A response icon12Mar 19, 2024A response icon12
Published inTDS ArchiveChatGPT Is Not a DoctorHidden dangers in seeking medical advice from LLMsFeb 23, 2024A response icon6Feb 23, 2024A response icon6
Hands in Human Dreams Look AI-GeneratedIt’s fun to see the surprising images of human hands produced by generative image models. Why are there so many fingers — or so few? It…Feb 6, 2024A response icon1Feb 6, 2024A response icon1
Published inAI AdvancesJob Loss from AILabor Market Impact of Large Language ModelsJan 29, 2024A response icon2Jan 29, 2024A response icon2
Just for Fun: Llama2 Chats with ChatGPT3.5I decided to have Llama2 and ChatGPT3.5 chat with each other just for fun. Llama2 is available here and ChatGPT3.5 here. I typed the first…Nov 29, 2023Nov 29, 2023