Okay, I underastand, there’s no point in clogging up the search engine looking for “the” and “and”. But honestly, sometimes you need a three-letter word to find what you’re looking for.
A few months ago, I was trying to find a thread called “Little Kid Logic”. But guess what? Without the kid part, “little logic” brings up about half the threads that were ever posted. Needless to say, I did not find what I was looking for.
Or suppose someone was looking for the epic Ron Thread, and didn’t remember that it was started by melodyharmonius. The only thing you could search for is “Ron”- again, three letters.
Some other three letter words that could prove vital to one’s search:
Eye
Sex
Leg
Zoo
TMI
Gnu
Spy
Sky
Kit
Shy
Red
Ewe
Lie
Pie
Men
Wax
Cow
Cat
Arm
Pro
Jew
Sew
Eve
Dog
Could we please, please get rid of the three-letter limit?
I’m not sure if vBulletin allows you to specify a list of words smaller then the limit to index. If it’s a feature in vBulletin then I can see this being useful. If it’s not already in vBulletin, then I don’t see that happening.
If you haven’t found your thread yet, Malleus, I’m pretty sure you were looking for this one. Refining the search to “thread titles only” will filter your results down quite a bit.
Other search terms would include OSX, Wii, and XP.
Really, it’d be best if the search engine didn’t have the length cutoff at all. Sure, you don’t want “the” and “and” indexed, but then, you also don’t want “them” or “thing” indexed, either. You handle those not by putting in a length cutoff, but by putting them on the badwords index.
Of course, at this point, it’d never get implemented, because to do so would require re-indexing the entire board, and that would probably require that the board be shut down for several days, which nobody wants.
You may get some archive results (which will still lead you to the FULL VERSION) but try google using site:boards.straightdope.com before your search arguments. In this case I had to put little kid logic in quotes
It’s not a cutoff, it’s a minimum. Which is currently set to 4 letters.
“When using the vBulletin default search, this option limits the size of indexed words. The smaller this number is, the larger your search index, and conversely your database is going to be.”
I can see that if words like “the” and “you” were indexed, the size of the index would go up dramatically. In terms of the physical layout, that really means the database itself, since it is likely that the indexes to the files are at least as large as the actual content. On the other hand, the number of three letter words that can exist at all is theoretically limited 26**3. It’s more limited if we exclude the sequences that are not allowed in English, and still more if we disallow grammatical function words such as articles and prepositions. Has anyone looked into what the indexing cost would be under those conditions?
I don’t know how long it would take to re-index the system but I have been told it would mean taking the system offline for a while at least so that could happen – you couldn’t post and couldn’t read. Might take a while. A long while.
Unless there’s dire reason to do so it’s best to let the system run so you can see it.
This thread reminds me of an impressive book in my university library - a complete Bible concordance (in English - Douay-Rheims Bible) that was at least 100 years old. I looked up the, and sure enough, the author had listed all the verses in the Bible that contained the word the. I’m pretty sure the poor guy didn’t have a computer to help him build his concordance!
Like when it becomes infected and bursts and you’re rushed to the hospital where a swarthy masked man uses a vacuum cleaner on your insides and – hopefully – spares you a painful death?