dc.contributor.advisor | Nguyễn, Quốc Trung | |
dc.contributor.advisor | Trương, Hoàng Vinh | |
dc.contributor.author | Phạm, Minh Tuấn | |
dc.contributor.author | Nguyễn, Quốc Hùng | |
dc.contributor.author | Lê, Phước Gia Thịnh | |
dc.date.accessioned | 2024-06-19T07:46:31Z | |
dc.date.available | 2024-06-19T07:46:31Z | |
dc.date.issued | 2024 | |
dc.identifier.uri | http://ds.libol.fpt.edu.vn/handle/123456789/4104 | |
dc.description.abstract | This project introduces a robust bilingual question-answering and natural Text-to-Speech (TTS) system, primarily designed to support both Vietnamese and English languages. Leveraging pre-trained Large Language Models (LLMs), the system enables seamless interactions in both text and speech domains. Key components include a Retrieval Augmented Generation Pipeline (RAG) for efficient information retrieval, an LLM module for response generation and an integration of a TTS module for human-like speech synthesis, and the development of a user-friendly Demo Web Application. The proposed solutions encompass various stages, including data collection, RAG framework development, fine-tuning of LLMs, and rigorous TTS evaluation using metrics and user surveys. Notable achievements of the project include the development of "T-LLama" - a 7-billion-parameter Bilingual QA LLM, ranked among the top 5 on the VMLU Leaderboard, the implementation of a Bilingual RAG system, the creation of "viXTTS" - the first LLM-based Vietnamese TTS model with voice cloning and multilingual capability through transfer learning, and the assembly of "viVoice" - a gigantic Vietnamese speech dataset with over 1000 hours of audio. | en_US |
dc.language.iso | en | en_US |
dc.publisher | FPTU HCM | en_US |
dc.subject | Đồ án tốt nghiệp | en_US |
dc.subject | Capstone Project | en_US |
dc.subject | Trí tuệ nhân tạo | en_US |
dc.subject | Artificial Intelligence | en_US |
dc.subject | Bilingual | en_US |
dc.subject | Hybrid | en_US |
dc.subject | Text to Speech | en_US |
dc.subject | SP24AI08 | en_US |
dc.subject | Hệ thống hỏi đáp | en_US |
dc.subject | Giọng nói tự nhiên | en_US |
dc.title | Hybrid Bilingual Question-Answering System with Human-Level Naturalness Text-to-Speech Extension | en_US |
dc.title.alternative | Hệ Thống Hỏi Đáp Đa Ngôn Ngữ với Giọng Nói Tự Nhiên Như Con Người | en_US |
dc.type | Thesis | en_US |
Bộ sưu tập thuộc về Trung tâm Thông tin - Thư viện - Trường Đại học FPT
Địa chỉ: Phòng 207 - Tầng 1 - Km 28 - Khu công nghệ cao Hòa Lạc - Thạch Hòa - Thạch Thất - Hà Nội
Điện thoại: 844.66805912 - FAX: - Email: thuvien_fu_hoalac@fpt.edu.vn