YouTube AV 50K: An Annotated Corpus for Comments in Autonomous Vehicles

View Researcher's Other Codes

Disclaimer: The provided code links for this paper are external links. Science Nest has no responsibility for the accuracy, legality or content of these links. Also, by downloading this code(s), you agree to comply with the terms of use as set out by the author(s) of the code(s).

Please contact us in case of a broken link from here

Authors Kaiming Fu, Lei Lin, Tao Li, Minsoo Choi, Siyuan Gong, Jian Wang
Journal/Conference Name 2018 International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)
Paper Category
Paper Abstract With one billion monthly viewers, and millions of users discussing and sharing opinions, comments below YouTube videos are rich sources of data for opinion mining and sentiment analysis. We introduce the YouTube AV 50K dataset, a freely-available collections of more than 50,000 YouTube comments and metadata below autonomous vehicle (AV)-related videos. We describe its creation process, its content and data format, and discuss its possible usages. Especially, we do a case study of the first self-driving car fatality to evaluate the dataset, and show how we can use this dataset to better understand public attitudes toward self-driving cars and public reactions to the accident. Future developments of the dataset are also discussed.
Date of publication 2018
Code Programming Language Python

Copyright Researcher 2022