Ensemble summarization models to leverage performance on CoronaNet

dc.contributor.committeeChairSheng, Victor
dc.contributor.committeeMemberChen, Lin
dc.contributor.committeeMemberChi, Sabrina
dc.creatorZhou, Fei
dc.date.accessioned2021-08-04T19:49:08Z
dc.date.available2021-08-04T19:49:08Z
dc.date.created2021-05
dc.date.issued2021-05
dc.date.submittedMay 2021
dc.date.updated2021-08-04T19:49:09Z
dc.description.abstractThe COVID-19 pandemic is the most fast-spreading and devastating event in recent history. CoronaNet Research Project produced the COVID-19 dataset regarding how governments responded to this pandemic. The dataset has given hand-written summaries for each recorded case and source URL links. The text data from the links are generally long articles, and we intend to apply NLP summarization in such a context. There are two approaches for summarization tasks. The abstractive methods, which extract the text from the source text to form summaries, are usually not very flexible but simpler. The other approach is abstractive techniques, which are usually more complicated than abstractive methods but more flexible in semantics. Since the deep learning field advancements, NLP summarization tasks have seen many successes and applications in our daily lives. In this thesis, we built a system that ensembles pre-training NLP models to leverage the abstractive summarization performance. We included focal loss function to improve performance by focusing on the samples with lower scores. Our proposed ensemble method improved the overall ROUGE scores compared to the individual models.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/2346/87597
dc.language.isoeng
dc.rights.availabilityAccess is not restricted.
dc.subjectNLP
dc.subjectText Summarization
dc.subjectDeep Learning
dc.titleEnsemble summarization models to leverage performance on CoronaNet
dc.typeThesis
dc.type.materialtext
thesis.degree.departmentComputer Science
thesis.degree.disciplineComputer Science
thesis.degree.grantorTexas Tech University
thesis.degree.levelMasters
thesis.degree.nameMaster of Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ZHOU-THESIS-2021.pdf
Size:
401.29 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.84 KB
Format:
Plain Text
Description: