I just tried to do a search on “green tea” here, and got this:
The search term you specified (tea) is under the minimum word length (4) and therefore will not be found. Please make this term longer.
If this term contains a wildcard, please make this term more specific.
Huh? Can you change the minumum to 3 letters? We need 3-letter word searches in case someone wants to do a search on gasp “God” or “law” or something silly like that.
just my 2 cents (Canadian 2 cents, so maybe that’s 1.3 cents)
Sorry. Here’s what the documentation for the board software has to say:
Indexing three-letter words would make the database very large I imagine. For example, the software would index the word “the”. I doubt we will change the settings to index three-letter words.
I don’t know anything about how this stuff works, but would there be any way to exclude sertain 3 letter words, (ex. the, him, her, etc) and still keep some. I think that it is not great to not be able to search for God, or law, or whatever.
heraldgwena - I don’t know what facilities the software offers as far as indexing customization, we will have to investigate that. But if you would have to list all the three-letter words you don’t want to index, I imagine that would be a pretty huge list and I personally wouldn’t want to bother. If it’s possible to list only a few three-letter words to index, maybe that could be done.
zev_steinhardt - I feel your pain. Perhaps you could go by Zeverino instead?
Would it not be easier to search for a different word to find the info you want? If you searched for “God” or “law”, you’d end up with a massive number of hits, which you’d have to narrow down by modifying the search anyway.
Just a thought (I personally don’t care one way or another), but perhaps you can index all the three-letter and two-letter words and then simply nix all the words that have an unusually high number of hits. Say apply some threshhold that any words more common than, say, “God” are wiped from the index, thereby saving you the trouble of actually creating a list?
I have no idea whether the software allows for this, but it’s an idea…
So why was it OK for our previous board search engine to have three-letter word capability, but not this version of the board?
Do I assume it is merely the whim of the board software? It could obviously be changed. At least, I think. Was it done deliberately? Or are we just accepting the “off the rack” version, no questions asked?
samclem - I assume jdavis wanted to reduce the size of the index. It’s very simple to change the minimum word length being indexed (but very time-consuming to recreate the index.) To tell you the truth, I never searched for a three-letter word that I can remember with the old board so I’ll have to take your word for it that three-letter words were previously indexed. We can ask jdavis why that value was chosen.
Is it really that much of a hardship to not have those words indexed? I don’t feel that affected by it. I usually remember at least one poster that appeared in a thread and do a search by user name.
That value was chosen because that is what the software upgrade defaulted to. The other Administrators can choose different values if they wish. Seeing how the vBulletin search index is already 1.4 GBs in size and it took me 12 days to create it I’m not in much of a mood to change the minimum word length.
The search index is already larger than the total RAM of the server. That does not make for fast searches in the first place. Do we want even slower searches and all the whines and moans of how much slower the board becomes as it is impacted by those even slower searches? I certainly don’t.
I came across this the other day; I was looking for a thread about USB; you can use wildcards, so USB* (which is four characters) is permitted, but if three-letter words aren’t indexed, then it isn’t going to help.
Jerry Thank you for the reply. You chose the default. And a sensible choice it was, as the server is not up to a more sophisticated application.
DDG and I have overnighted a check to the SDMB for $20,000 to purchase a new server. Perhaps you have read about it in one of her threads. This should put some joy back into your life.*
One place this is gonna pinch, is with searches for discussions of US corporations and government agencies, many of which have 3-letter acronyms (CBS, FCC).
Not saying what should or shouldn’t be done, just noting that the inability to do searches on 3-letter terms has the potential to be a non-trivial problem.