no code implementations • 17 Apr 2022 • Benjamin D. Horne
In order to better support researchers, journalist, and practitioners in their use of the MeLa-BitChute dataset for exploration and investigative reporting, we provide new psycho-linguistic metadata for the videos, comments, and channels in the dataset using LIWC22.
1 code implementation • 10 Mar 2022 • Maurício Gruppi, Benjamin D. Horne, Sibel Adali
In this paper, we present the fifth installment of the NELA-GT datasets, NELA-GT-2022.
no code implementations • 10 Feb 2022 • Milo Trujillo, Maurício Gruppi, Cody Buntain, Benjamin D. Horne
In this paper we present a near-complete dataset of over 3M videos from 61K channels over 2. 5 years (June 2019 to December 2021) from the social video hosting platform BitChute, a commonly used alternative to YouTube.
no code implementations • 15 Jan 2021 • Maurício Gruppi, Benjamin D. Horne, Sibel Adali
Moreover, we show that the addition of CSN features increases the accuracy of writing style models, boosting accuracy as much as 14\% when using Random Forests.
no code implementations • 26 May 2020 • Benjamin D. Horne, Maurício Gruppi, Sibel Adali
A major concern with text-based news veracity detection methods is that they may not generalize across countries and cultures.
2 code implementations • 18 Mar 2020 • Maurício Gruppi, Benjamin D. Horne, Sibel Adali
In this paper, we present an updated version of the NELA-GT-2018 dataset (N{\o}rregaard, Horne, and Adal{\i} 2019), entitled NELA-GT-2019.
Computers and Society
no code implementations • 2 Apr 2019 • Jeppe Norregaard, Benjamin D. Horne, Sibel Adali
In this paper, we present a dataset of 713k articles collected between 02/2018-11/2018.
Computers and Society
no code implementations • 27 Aug 2018 • Benjamin D. Horne, William Dron, Sibel Adali
To answer these questions, we compute well-studied content-based features on over 60K news articles from 4 communities on reddit. com.
no code implementations • 7 Jun 2018 • Mauricio Gruppi, Benjamin D. Horne, Sibel Adali
Our results show that features related to the writing style were prominent in both data sets and, despite the language difference, some features have a universal behavior, being significant to both US and Brazilian news articles.
1 code implementation • 27 Mar 2018 • Benjamin D. Horne, William Dron, Sara Khedr, Sibel Adali
In this paper, we discuss the first release of the data set and demonstrate 4 use cases of the data and features: news characterization, engagement characterization, news attribution and content copying, and discovering news narratives.
Computers and Society
2 code implementations • 28 Mar 2017 • Benjamin D. Horne, Sibel Adali
The problem of fake news has gained a lot of attention as it is claimed to have had a significant impact on 2016 US Presidential Elections.