evaluation
evaluation
Evaluating the Performance of Eight AI Chatbots in Literature Retrieval: Grok and DeepSeek Outperform ChatGPT, but None Are Perfectly Accurate
Grok and DeepSeek outperform ChatGPT in literature retrieval, yet no AI chatbot achieves fully accurate results across all tasks.
AI chatbots
literature retrieval
Grok
DeepSeek
ChatGPT
evaluation
information retrieval
16
2025-09-29
1