Interesting article on machine learning and linguistics.
Earlier this year, we started constructing a data set that consists of all of the viewer comments on YouTube videos posted by four television networks – MSNBC, CNN, Fox News and One America News Network – that target slices of the political spectrum. Together, the data set contains over 85 million comments on over 200,000 videos from 6.5 million viewers since 2014.
Our machine learning translation system found that words with vastly different meanings, like “KKK” and “BLM,” were used in the exact same contexts depending on the YouTube channel being analyzed.
No comments:
Post a Comment