YouTube AV 50K: An Annotated Corpus for Comments in Autonomous Vehicles
View Researcher's Other CodesDisclaimer: The provided code links for this paper are external links. Science Nest has no responsibility for the accuracy, legality or content of these links. Also, by downloading this code(s), you agree to comply with the terms of use as set out by the author(s) of the code(s).
Please contact us in case of a broken link from here
Authors | Kaiming Fu, Lei Lin, Tao Li, Minsoo Choi, Siyuan Gong, Jian Wang |
Journal/Conference Name | 2018 International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP) |
Paper Category | Artificial Intelligence |
Paper Abstract | With one billion monthly viewers, and millions of users discussing and sharing opinions, comments below YouTube videos are rich sources of data for opinion mining and sentiment analysis. We introduce the YouTube AV 50K dataset, a freely-available collections of more than 50,000 YouTube comments and metadata below autonomous vehicle (AV)-related videos. We describe its creation process, its content and data format, and discuss its possible usages. Especially, we do a case study of the first self-driving car fatality to evaluate the dataset, and show how we can use this dataset to better understand public attitudes toward self-driving cars and public reactions to the accident. Future developments of the dataset are also discussed. |
Date of publication | 2018 |
Code Programming Language | Python |
Comment |