Public Discussion of DeepSeek Large Language Model on Twitter: A Mixed-Methods Sentiment and Topic Modeling

Authors

  • Wei Chien Ng School of Management, Universiti Sains Malaysia, Malaysia and College of Business Administration, International University of Business Agriculture and Technology, Bangladesh https://orcid.org/0000-0003-1293-6781
  • Shiwen Chen School of Management, Universiti Sains Malaysia, Malaysia
  • Quan Kai Ang School of Management, Universiti Sains Malaysia, Malaysia https://orcid.org/0009-0006-4724-4418
  • Yu Qing Soong Department of Accountancy and Business, Tunku Abdul Rahman University of Management and Technology, Malaysia https://orcid.org/0009-0002-5569-6757

DOI:

https://doi.org/10.47852/bonviewAIA62027447

Keywords:

DeepSeek, Twitter, large language model, natural language processing, sentiment analysis

Abstract

Artificial intelligence (AI) has moved from research laboratories into everyday tools used by millions worldwide. In recent years, advances in natural language AI systems have sparked extensive public exploration and discussion. This study investigates overall public sentiment and key discussion topics related to the DeepSeek large language model (LLM) on Twitter (now rebranded as X) and examines sentiment differences across discussion topics during various DeepSeek-related events. After data collection, Python was used to perform preliminary cleaning and screening of English-language tweets. The Valence Aware Dictionary and Sentiment Reasoner (VADER) sentiment analysis tool was applied to classify tweet sentiment. Based on the VADER labels, the dataset was stratified to obtain a high-quality sample of 5000 tweets while preserving the original sentiment distribution. To further explore discussion themes, latent Dirichlet allocation combined with coherence score evaluation was employed for topic modeling. Topic-level sentiment analysis was then conducted across different DeepSeek-related events to assess public attitudes toward each discussion topic. Results indicate that overall public sentiment toward DeepSeek LLM is predominantly positive. Topic modeling identified 10 optimal discussion topics, covering areas such as technical performance, economic impact, political and cultural context, and international competition. The findings also reveal significant differences in sentiment distribution across topics, demonstrating the practical value of combining sentiment analysis and topic modeling for business intelligence and AI product optimization.

 

Received: 29 August 2025 | Revised: 26 December 2026 | Accepted: 11 March 2026

 

Conflicts of Interest

The authors declare that they have no conflicts of interest to this work.

 

Data Availability Statement

The data that support the findings of this study are openly available in Kaggle at https://www.kaggle.com/datasets/bwandowando/tweets-and-reaction-on-deepseek-models and https://www.kaggle.com/datasets/datatattle/covid-19-nlp-text-classification.

 

Author Contribution Statement

Wei Chien Ng: Conceptualization, Methodology, Resources, Writing – original draft, Supervision, Project administration, Funding acquisition. Shiwen Chen: Methodology, Software, Formal analysis, Investigation, Data curation, Writing – original draft. Quan Kai Ang: Writing – review & editing, Visualization. Yu Qing Soong: Validation, Writing – review & editing.


Downloads

Published

2026-03-25

Issue

Section

Research Article

How to Cite

Ng, W. C., Chen, S., Ang, Q. K., & Soong, Y. Q. (2026). Public Discussion of DeepSeek Large Language Model on Twitter: A Mixed-Methods Sentiment and Topic Modeling. Artificial Intelligence and Applications. https://doi.org/10.47852/bonviewAIA62027447