Harnessing Technology for Pandemic Support: Develop-ing a COVID-19 Chatbot in Vietnam
Exploiter la technologie pour soutenir la pandémie : développer un chatbot COVID-19 au Vietnam
Vinh Nguyen-Nhu1,2, Tho Thanh Quan3, Thien Duc Duong1, Son Trung Huynh1, Ngan Thi Kim Le1, Duy Phung Quoc Vo1, Liem Thanh Nguyen1, Bao Pham Gia Nguyen1, Hue Thi Hong Nguyen1, Phuc Hoang Vo1, Tung Ngoc Nguyen1, Chinh Quang Doan3, Minh Tran Cong Bui3, Sy Duong-Quy4
1
: Faculty of Medicine, University of Medicine and Pharmacy at Ho Chi Minh City, Ho Chi Minh City, Vietnam
2
: Department of Respiratory Functional Exploration, University Medical Center, Ho Chi Minh City, Vietnam
3
: Ho Chi Minh City University of Technology (HCMUT), VNU-HCM, Ho Chi Minh City, Vietnam
4
: Bio-Medical Research Center, Lam Dong Medical College, Da Lat, Viet Nam
Corresponding author:
Vinh Nguyen–Nhu. University Medical Center, University of Medicine and Pharmacy at Ho Chi Minh City, Ho Chi Minh City, Vietnam
E-mail: vinhnguyenmd@ump.edu.vn
ABSTRACT
Background. In the context of limited human resources during a pandemic, chatbots serve as vital tools for disseminating essential information. This study aimed to outline the design process of a chatbot tailored to the public's demand for time-ly and visually accessible COVID-19 information in Vietnam. Methods. Task-Oriented Dialogue Systems were used to build the Chatbot. After retrieving data according to the key label, Chatbot's content was curated and managed using a NoSQL database and MongoDB database management system. We incorporated Support Vector Machine, Named Entity Recognition and K – nearest neighbor to train the model. Results. The developed database comprises 12 modules contain-ing over 1000 sets of questions and answers, addressing prevalent COVID-19 concerns. The Chatbot dataset was strictly reviewed and validated to ensure its accuracy and suitability for community education to be scientifically accurate and appropriate for community education. A preliminary survey involving 245 medical students revealed encouraging poten-tial for the application of the Chatbot. Conclusion. We have initially constructed a chatbot containing comprehensive con-tent pertaining to COVID-19. Despite limitations in financial and human resources, our Chatbot serves as a foundation for future research aimed at refining this tool to meet the demands for medical information search.
KEYWORDS: COVID-19; Chatbot; SARS-CoV-2; Pandemic; Vietnam.
RÉSUMÉ
Introduction. Dans le contexte de ressources humaines limitées en période de pandémie, les chatbots constituent des outils essentiels pour diffuser des informations essentielles. Cette étude visait à décrire le processus de conception d'un chatbot adapté à la demande du public d'informations COVID-19 opportunes et visuellement accessibles au Vietnam. Méthodes. Des systèmes de dialogue orientés tâches ont été utilisés pour créer le Chatbot. Après avoir récupéré les données selon l'éti-quette clé, le contenu de Chatbot a été organisé et géré à l'aide d'une base de données NoSQL et d'un système de gestion de base de données MongoDB. Nous avons incorporé Support Vector Machine, Named Entity Recognition et K – le plus proche voisin pour entraîner le modèle. Résultats. La base de données développée comprend 12 modules contenant plus de 1 000 ensembles de questions et réponses, répondant aux préoccupations courantes liées au COVID-19. L'ensemble de données Chatbot a été strictement examiné et validé pour garantir son exactitude et son adéquation à l'éducation commu-nautaire afin qu'il soit scientifiquement précis et approprié pour l'éducation communautaire. Une enquête préliminaire auprès de 245 étudiants en médecine a révélé un potentiel encourageant pour l’application du Chatbot. Conclusion. Nous avons initialement construit un chatbot contenant un contenu complet relatif au COVID-19. Malgré les limites des res-sources financières et humaines, notre Chatbot sert de base à de futures recherches visant à affiner cet outil pour répondre aux demandes de recherche d'informations médicales.
MOTS CLÉS: COVID-19; Chatbot; SARS-CoV-2; Pandémie; Vietnam.
INTRODUCTION
COVID-19 is an infectious disease caused by the SARS-CoV-2 virus, which has been recognized as a global pandemic by the World Health Organization (WHO). COVID-19 infection can progress severely and re-quires medical care, especially in individuals with un-derlying conditions such as heart disease, diabetes, chronic lung disease, or cancer. Early intervention in the community by providing real-time, evidence-based information for people who are seeking help for their COVID-19 knowledge and information is essen-tial to protect infectious patients and reduce the bur-den on the healthcare system, which is always over-loaded everywhere during the pandemic. Studies proved that understanding the disease and how the virus spreads is the best way to prevent and slow the infection rate [2]. Given the limited human resources and the vast amount of information and knowledge related to COVID-19 that surpasses the consultant’s capacity to respond to information seekers in a brief time, a chatbot may serve as an alternative method to
address this issue. The fact that chatbots can offer rap-id and pertinent information regarding disease and necessary local information may contribute to the re-duction of mortality rates and expenses associated with COVID-19 treatment.
In January 2020, following the declaration of the novel coronavirus as a pandemic, the WHO, Centers for Dis-ease Control and Prevention (CDC) and United Na-tions Children's Fund (UNICEF) utilized chatbots to disseminate information about COVID-19 [1-4]. These chatbots provide up-to-date news and information about the virus, including symptoms of infection and measures individuals may take to protect themselves and others; the information is presented in various formats such as text, video, and graphics. Additional chatbots have also been developed to assist communi-ties in alleviating the impact of COVID-19, such as "COVID-19 Preventable" in Thailand and "COOPERA"
in Japan [5,6].
In Vietnam, the Information Technology Department -Ministry of Health launched the “Q&A Chatbot about COVID-19” on their electronic information page in 20207 This chatbot provides the latest comprehensive information on COVID-19, including government di-rectives, disease prevention and control measures by the Ministry of Health. Updated statistics on the situa-tion, the number of domestic and international pa-tients, the origin and mechanism of transmission of the virus, infection symptoms, and a self-assess infection scoring system for infection are also offered. Addition-ally, the chatbot includes a list of healthcare facilities nationwide that are qualified to receive and treat COVID-19 patients and synthesizes news on COVID-19 [7].
Nevertheless, the interaction between this chatbot and users is limited to text messages exclusively, without the capability for image or video formats. This limitation may cause confusion for users in cer-tain situations, such as when dealing with medical devices or following breathing exercise instructions. Furthermore, the data available for this chatbot is still limited in terms of its comprehensiveness and diversity. Therefore, this research aims to develop the design process of a chatbot that caters to the pub-lic’s need for prompt and visually accessible infor-mation regarding COVID-19, provides continuous updates from various authoritative sources and is suitable for the social context of Vietnam.
METHODS
The Chatbot model is collaboratively developed by an integrated team of lecturers and students from the Ho Chi Minh City University of Technology, fo-cusing primarily on software design, and from the University of Medicine and Pharmacy at Ho Chi Minh City, being responsible for designing the COVID-19 dataset. The process is conducted through several steps:Determining the modelTo establish a highly efficient and effective interac-tive conversation between users and computers, many factors are needed. These include compre-hending user intents, providing prompt responses, and, most significantly, possessing a system capable of fulfilling these needs. Among the systems for building chatbots, Task-Oriented Dialogue Systems (TODS) have many advantages compared to other systems and can meet the mentioned requirements.8 Therefore, the TODS was chosen as the framework to build the COVID-19 Chatbot. TODS employs two methodologies: the pipeline and the end-to-end. Pipeline is an approach that consists of multiple sub-modules with different tasks, and the system will operate by passing data through each module. For the end-to-end, only one module is responsible for processing user information. The pipeline approach is suitable for constructing the COVID-19 Chatbot (Figure 1)
The structure of a pipeline-based TODS system consists of four main components:
•Natural Language Understanding Block (NLU): When the user's text message is inputted, the NLU functions to analyze and present it in pre-defined structured data. In Chatbot soft-ware, the structured data format includes Intents and Entities, and each Chatbot will have its specific settings.
•Dialogue State Tracking Block (DST): This block is responsible for monitoring and updating the state of the conversation based on the tracked dialogue state.
•Dialogue Policy Block (DP): Based on the conversation state from the Dialogue State Tracking block, the DP will determine the next action of the system.
•Natural Language Generation Block (NLG): Based on the ac-tion determined by the Dialogue Policy block, the Natural Lan-guage Generation block is responsible for providing a response to the user in natural language.
Building up content
MongoDB, a NoSQL database system was used as database management system to build up Chatbot's content. The data were structured as question-answer pairs. In the former part, questions were for-mulated based on aspects and concerns of the public about COVID-19. These sources included social me-dia, articles, television, live broadcasts, textbooks, courses on COVID-19, and guidance documents pro-vided by the Ministry of Health and reputable medi-cal associations.
After constructing the set of questions, correspond-ing answers were generated based on reliable and reputable sources. This was done to ensure that each answer fulfilled the criteria of being correct, com-plete, and easy to understand. In this way, the initial set of questions and answers was formed. The up-coming dataset was then edited, reviewed, tested, and finalized to create the most comprehensive da-taset. A dataset of over 1000 questions and answers has been established and will be consistently updated and expanded as needed. The data are categorized into 12 modules covering the corre-sponding issues related to COVID-19, as listed below:
General information and epidemiology: The chatbot pro-vided initial, basic information about COVID-19 and a more detailed understanding of this pandemic. Additionally, information regarding anticipating public concerns, variants of the virus and the mecha-nisms of transmission was also provided.
State regulations: The state regulations are flexible in response to epidemic situations. Therefore, the data in this part was updated to align with the “new situ-ation” state of the whole country. The data issourced from official information in the most recent documents from the Ministry of Health, Ho Chi Minh City Department of Health, and the Ho Chi Minh City Center for Disease Control (HCDC), fo-cusing on isolation regulations and other policy regulations.
Prevention measures: One of the contributing factors in preventing a pandemic is the effective implemen-tation of preventive measures to control the spread of the virus. This is also one of the most concerning and searched-for topics during the pandemic. Hence, constructing a chatbot scenario based on this is ap-propriate and highly necessary. This module focuses on developing content for this scenario around the Vietnamese “5K model” (Khau trang (facemask)-(Khu khuan) disinfection- (Khoang cach) distance-(Khong tu tap) no gathering- (Khai bao y te) health declaration).
Health declaration and contact information: In relation to this module, Chatbot provides users with the lat-est addresses and information sourced from the Min-istry of Health regarding the regulations governing healthcare for insured patients. Furthermore, users are provided with district-level government/healthcare contact phone numbers and guidance on obtaining oxygen. These vital pieces of information become extremely crucial for everyone during this outbreak, especially in severe situations.
COVID-19 symptoms: It is crucial to monitor symp-toms to prevent severe cases, hospitalizations, and fatalities. Citizens should be able to recognize signs of deterioration to use appropriate medication and promptly seek medical assistance. A dataset was created to help users with inquiries regarding mild, moderate, and severe symptoms of COVID-19 and underlying health conditions that pose a risk for se-vere progression of COVID-19.
COVID-19 testing: The demand for testing is continu-ously increasing, and testing is not only conducted at healthcare centers but also at homes. This leads to a need for information on how to perform the tests and interpret the results. Chatbot's testing scenario emphasizes the provision of essential information. The current dataset will focus on rapid antigen test-ing and RT-PCR testing, the two most interesting categories to the community. For each category, questions based on key components were formulat-ed according to characteristics, target groups, loca-tions, and result interpretation.
COVID-19 diagnosis: In the current situation of the pandemic, with a considerable risk of close contact, users will have many concerns about whether they have been infected with COVID-19. Therefore, the issue of diagnosis is crucial. To optimize responses for users, the diagnosis module incorporates two-way communication between the Chatbot and users to gather comprehensive information and provide the most suitable risk assessment and guidance. The interaction can also be seen between those with symptoms and users without symptoms, as the diag-nosis module will be developed in two directions for the Chatbot to provide them with answers. The diag-nosis tree was developed to solve this issue based on initial symptoms reported by individuals to suggest the likelihood or risk of infection.
Symptomatic users will be divided into two streams: Those who have been tested and those who have not. The former group will be further categorized according to the type of test, i.e., rapid antigen, or RT-PCR. With a negative test result, users will be further asked about epidemiological factors and previous contacts. The user’s risk of COVID-19 infection and recommended next steps are determined based on symptoms, test results, epidemiological factors, and previous contacts. For the asymptomatic group, the questioning will fo-cus on the suspicion of contracting the disease through inquiries other than symptoms.
Vaccination: Aiming to achieve nationwide community immunity, the policy focuses on implementing a vac-cination strategy that provides free vaccines to the public. The aim of the country is to vaccinate the entire eligible population. Given this situation, there is a high demand for a comprehensive understanding of vac-cines, and it is crucial to provide people with accurate information. The vaccine-related topics include vac-cine characteristics, target groups, vaccines related to specific groups, side effects and complications, pre-and post-vaccination precautions, and common mis-conceptions about vaccines.
Treatment: Given the widespread misuse of COVID-19 treatment medications, the dissemination of unverified information in online communities and the disregard for the guidance of medical professionals, the Ministry of Health has promptly issued official documents providing regularly updated instructions on how to treat infectious patients at home. The data on home -based drug treatment included medication packages supplied by the Ministry of Health, indications and contraindications for different patient groups, and common mistakes encountered when using medication.
Health promotion: The pandemic has significantly im-pacted the health and lives of many people. It is un-derstandable that there is increased concern and atten-tion towards one's health. Therefore, health promotion was chosen as one of the main modules, addressing the public's questions regarding the use of traditional medicine methods, exercises (breathing exercises), and psychological issues.
Medical equipment: Data on equipment is essential, es-pecially when most infectious cases are self-monitored at home. People need to know how to use healthcare equipment at home. Sets of questions and answers were developed focusing on the correct usage and common mistakes when using devices such as SpO2 monitors, blood pressure monitors, thermometers, and oxygen tanks.
Post-COVID issues: Following the decline of the COVID-19 pandemic in Ho Chi Minh City and the whole country, people's attention has shifted from pre-ventive measures, diagnosis, and COVID-19 testing to focusing on methods to enhance overall health and addressing post-COVID-19 issues. Chatbot has also adapted to be more aligned with these new circum-stances, with a particular emphasis on post-COVID issues – a topic of great current interest. Post-COVID is a new issue, and there are many aspects that peo-ple are not yet aware of or do not understand clearly. Thus, for this module, the Q&A dataset primarily comprises basic information about post-COVID health concerns, including definitions, symptoms, methods to improve post-COVID health (lung func-tion, mobility, and endurance), and the timing and location for post-COVID consultations.
Processing and querying data
The user's messages were processed through the fol-lowing steps: (i) preprocessing the data prior to in-putting it into the classification model, (ii) storing the data received from the system's models into the database, and (iii) retrieving the data after receiving the key label for the question from the models and scripts.
Model training
Data were divided into two parts, with the training part comprising 85% of the content and the testing part encompassing the remaining 15%. We utilized the Support Vector Machine (SVM), Named Entity Recognition (ER) and K – nearest neighbor (KNN) to train the model.
System design and implementation
The Chatbot interface is deployed on a web platform and consists of two components. The first is a web-based user interface (UI); the second is a backend that handles the conversation using the Botkit frame-work. The UI and the backend will communicate with each other through the WebSocket protocol. Upon receiving a message from users, the Chatbot will transmit it to the backend via the Application Programming Interface (API) and follow the execu-tion flow.
Ethical Considerations in the Study
This research adheres to ethical principles in medical research and computer software development. The information in the Chatbot COVID-19 software has been verified and ensured for accuracy by the expert panel. The Chatbot COVID-19 software respects and protects the data of individuals/organizations. It does not infringe on the privacy rights of others, and it does not collect user personal information. This proposal is written by the research team and has not been published in any other work. This proposal has been read, edited, and approved by all members of the team and the supervising instructors. Our study does not collect personal information from survey participants, and participants face no physical or mental health risks. The study was approved by the Ethics Committee of the University of Medicine and Pharmacy at Ho Chi Minh City.
RESULTS
A database comprising 12 modules with over 1000 sets of questions and answers, addressing most of the issues related to COVID-19 that are of interest to the public. The number of pairs of questions and answers for each module is shown in Figure 2.
The Chatbot dataset has been reviewed and con-firmed by the Training Center for Family Physicians of the University of Medicine and Pharmacy at Ho Chi Minh City. It has been determined to be scientifi-cally accurate and appropriate for community edu-cation to assist the public during the COVID-19 pan-demic.
Applying machine learning and deep learning tech-niques allows Chatbot to effectively understand the users’ intentions and information. The development and refinement of the Chatbot system on the website platform https://covid19chatbot.com has been com-pleted. The system has been tested on 245 medical students from the Faculty of Medicine, University of Medicine and Pharmacy at Ho Chi Minh City, yield-ing promising results (Figure 3, Figure 4, Figure 5).
DISCUSSION
Since the COVID-19 pandemic, artificial intelligence (AI) technology in healthcare has gained increasing attention and application. The literature on using chatbots in the health sector has been steadily in-creasing [1]. Recent research emerging since the be-ginning of the pandemic in 2020 indicates that chat-bots play a distinctive role in the aftermath of the COVID-19 pandemic [9]. When crafting content for a chatbot in a critical emergency, emphasis should be placed on maintaining an iterative and agile ap-proach, leveraging complementary forces to triangu-late data, and striking a balance between global knowledge and a local focus [1]. Chatbot software development represents a corner-stone and a step forward in healthcare. The software will enable people to access accurate and timely in-formation about the COVID-19 pandemic while alle-viating the burden on the healthcare system during that pandemic time. This chatbot operates in the
Vietnamese language and engages in bidirectional communication with users via diverse mediums such as text, images, and audio, partly contributing to the successful controlled pandemic in Vietnam.
Several challenges and limitations have been identi-fied through the design and development process, both in terms of data construction and software devel-opment. Regarding data construction, the database constructed remains insufficient in size. As a result, Chatbot may not provide all the information that users need. This lack of compatibility may lead users to be-lieve that our Chatbot does not offer enough practical benefits, causing them to turn to other search engines. Additionally, ensuring the reliability and accuracy of data is still a challenge. While information is used from reputable and official sources like WHO, CDC, and HCDC, the database is quite extensive with vari-ous modules. This can lead to minor errors during ed-iting and data entry. Therefore, Chatbot requires a professional team to review the information regularly. Another challenge is the issue of maintaining regular updates of information. Due to the complex and rapid-ly changing pandemic situation worldwide and in Vi-etnam (e.g., the emergence of new virus variants, new vaccines, and new policies from the government and the Ministry of Health), verifying and updating the latest information frequently and continuously regard-ing the pandemic has been challenging. Besides, there is some inconsistency in information about the pan-demic between Vietnam and the rest of the world, as well as between different regions within Vietnam. This includes differences in regulations and policies for pandemic prevention, such as isolation and vaccina-tion policies. Finally, the language used in the current platform remains limited. The Chatbot, for instance, still does not provide enough data on the languages used in different regions (local dialects, ethnic lan-guages, internet slang, and so forth).
In terms of software development, there is a limitation in question recognition capability. As the Chatbot is programmed to respond to simple questions that are already present in the compiled dataset, it may not be able to recognize complex, lengthy, multi-faceted questions, or questions not included in the dataset. This is an area that needs improvement and requires a team with greater expertise in software development. The second challenge lies in the inadequate availabil-ity of high-level expertise required for software devel-opment, upgrades, maintenance, and error correction during operation.
Lastly, the accessibility and promotion of Chatbot still have many limitations. Our users can only ac-cess Chatbot through the provided website, and most people are uninformed about it. Thus, it is es-sential to integrate the COVID-19 Chatbot into popu-lar platforms such as social networks and mobile applications. This will increase its visibility and pro-vide more convenient conditions for users.
Financial support and additional extensive research and development for the Chatbot are necessary to address these challenges. This includes collaborating with a team of experts in higher-level software de-velopment tasks such as building, maintaining, de-bugging, and continuously improving the software. The Chatbot also needs financial support to facilitate its integration with popular social networks and messaging apps such as Messenger and Facebook, as well as to support promotional campaigns and com-munication initiatives to make it more accessible to users.
CONCLUSIONS
Given the unpredictable complexity of the pandem-ic, there is a great need for advanced and innovative approaches to managing the crisis. We have devel-oped a chatbot on the topic of COVID-19 to assist users in accessing authoritative information about health and the pandemic, countering misinfor-mation, evaluating individual risks, tracking expo-sure to SARS-CoV-2 and health symptoms, and dis-seminating information on disease prevention measures as well as relevant state regulations.
Although our preliminary survey shows that most users are satisfied with this Chatbot, there are limita-tions and challenges as the development process is still in its initial stages. Therefore, multidisciplinary collaboration and support in terms of funding and workforce are needed to develop Chatbot further and make it more refined.
ACKNOWLEDGEMENTS
We acknowledge Ho Chi Minh City University of Technology (HCMUT), VNU-HCM and the University of Medicine and Pharmacy at Ho Chi Minh City, Ho Chi Minh City, Vietnam for supporting this study.
FUNDING
University of Medicine and Pharmacy at Ho Chi Minh City, Ho Chi Minh City, Vietnam.
CONFLICT OF INTERESTS
Non.
REFERENCES
FIGURES
REFERENCES
ARTICLE INFO DOI: 10.12699/jfvpulm.15.46.2024.12
Conflict of Interest
Non
Date of manuscript receiving
05/12/2023
Date of publication after correction
27/02/2024
Article citation
Vinh Nguyen-Nhu, Tho Thanh Quan, Thien Duc Duong, Son Trung Huynh, Ngan Thi Kim Le, Duy Phung Quoc Vo, Liem Thanh Nguyen, Bao Pham Gia Nguyen, Hue Thi Hong Nguyen, Phuc Hoang Vo, Tung Ngoc Nguyen, Chinh Quang Doan, Minh Tran Cong Bui, Sy Duong-Quy. Harnessing Technology for Pandemic Support: Develop-ing a COVID-19 Chatbot in Vietnam. J Func Vent Pulm 2024;46(15):12-18