AI chatbots still can’t accurately answer high-level history questions: study
While artificial intelligence excels at tasks like coding and podcast generation, it struggles to accurately answer high-level history questions, according to a study. Researchers tested OpenAI’s GPT-4, Meta’s Llama and Google’s Gemini using a newly developed benchmark called Hist-LLM. The benchmark relies on the Seshat Global History Databank, a comprehensive database of historical knowledge. Artificial…
React to this headline: