I am doing a machine learning project that requires a large amount of text documents written by people over the internet. One of my first thoughts was to use sdmb posts. However, I read the TOS and it says:
So clearly I cannot mine posts willy-nilly. Would I be allowed to use posts if the user explicitly permitted me to? I suspect the answer is no, since the TOS and disclaimer imply that Creative Loafing own the content, not the user. And if the answer is no, then I fully understand and I won’t undergo anything of this nature on these boards.
Asking each individual separately seems like a lot of work. Why don’t you do a search for either public domain or (if it must be written specifically for the web) blogs that use a Creative Commons license instead?
No, my posts are my posts and I can do anything with them that I wish. By posting them here, I give the board a NONexclusive and irrevocable right to use my posts as well.
You could also apply to TBTB for a grant, probably via PM. The grant would be in-kind. Seriously, we’re here to fight ignorance and if you have a research proposal, you may be able to get permission.
You have full and complete permission to use and/or republish my posts for any purpose not otherwise prohibited by law. 38,000 should be enough to be getting on with, right?
Swords to Plowshares, you hereby have my permission to use any of my SDMB posts, except those in which I sound stupid… which may severely limit your choices.