All the news messages in the 20 newsgroups test corpus were created during 1993 and were openly accessible in Usenet a part of the internet. The 20 newsgroups corpus itsself has been publicly available as an downloadable archive since at least 1997 (from Tom Mitchel's site accompaning his well-known book "Machine Learning").
Nevertheless being confused about the subleties of national and global data privacy protection rules I decided to protect the message text body by an Quiz. Through this I want to make sure, that the data is used exclusively for educational purposes.