|
|
|
#1
|
|||
|
|||
|
Why can't I search for the word "alone"?
I tried to search for the words movies and alone, and the search function simply retreived all threads containing the word movies. Then I tried a search for just the word alone, and got no results. I was searching entire messages, not just titles, and I changed the date restriction to "any date."
It's impossible that there are no posts containing the word "alone." Is "alone" some kind of special reserved search word? |
| Advertisements | |
|
|
|
|
#2
|
|||
|
|||
|
Did you search & see if your post you just wrote shows up?
The search engine is pretty freaky sometimes. (Also, no can search for 3 letter words, this is because it would make the database too big.) |
|
#3
|
|||
|
|||
|
Did you search & see if your post you just wrote shows up?
The search engine is pretty freaky sometimes. (Also, no can search for 3 letter words, this is because it would make the database too big.) |
|
#4
|
|||
|
|||
|
Interesting..alone doesn't show up. Not even when just searching this forum.
|
|
#5
|
|||
|
|||
|
As handy said, the search engine's freaky, and having one of those spells with this. I put in a search for other five-letter words that have appeared in this thread: "tried" and "forum". Only the latter showed up in the search. Quite odd.
|
|
#6
|
|||
|
|||
|
Bizarre. I just tried searching for "anyway" and "always". Nothing.
One unusual thing I noticed is that when I search for these words that should be generating results, I don't get that intermediary "Your search is in progress" screen. I get taken right from the screen with the form on it to the "Sorry, no matches" message. |
|
#7
|
|||
|
|||
|
No idea why this happens, however you can search for the word "alone" if you stick an asterisk at either end:
*alone* |
|
#8
|
|||
|
|||
|
Is it possible that there are so many search matches that nothing comes up? That would make sense in regards to the "anyway" and "always" searches, too.
|
|
#9
|
|||
|
|||
|
Quote:
|
|
#10
|
|||
|
|||
|
Quote:
Usually, anyway, I search for word-terms that aren't all that common. Usually proper nouns. |
|
#11
|
|||
|
|||
|
At http://www.google.com/
You can search for: "straightdope alone" (no quotes) & get some results. I dont know if they index the forum often. |
|
#12
|
|||
|
|||
|
Quote:
Maybe posts take a little bit to get added to the search index? That wouldn't explain the original problem of not finding "alone" in a search, but it might explain why some of the other searches for words used in this thread didn't turn up. |
|
#13
|
|||
|
|||
|
No, any post that is made is either added to the Search Index at post time, or else it is not added at all. The only exception is if the Board is having serious problems, in which case it is possible (but unlikely) that only part of the post would be parsed into the search index. It typically is an all-or-nothing affair.
Many words are excluded from the Search index by default. These words are listed in a file called "badwords.php". An example of some of the words in it are: PHP Code:
Note that "alone" is in the list. Now, an Admin with server access can set the flag to be "0", which inicates that a word is not flagged as bad, or they can delete the entry, or add new ones. After discovering that the words "forum", "post", "board", etc. were taking up a disproportionate amount of space in the index, I added them to my index. |
|
#14
|
|||
|
|||
|
Since "alone" isn't indexed, what happens when you search for "alone*" like cazzle suggested? The software doesn't start a full text search of all posts in the selected ranges, does it? Or does it?
|
|
#15
|
|||
|
|||
|
Quote:
|
|
#16
|
|||
|
|||
|
How did you get the badwords.php index?
|
|
#17
|
|||
|
|||
|
bluecanary, since Anthracite hasn't been back to your question yet, I'll attempt a speculation.
Anthracite has her own message board that runs the same software as the SDMB. As a consequence, she has goddess-like powers to access the information she shared. She noted that these were the default settings for the software. Hope that helps. |
|
#18
|
|||
|
|||
|
Quote:
The "badwords.php" doesn't have any sensitive info in it, nor is it Board code - it's just a honkin' big list of words. So I felt safe in posting it. I honestly don't have an answer to the asterisks question - I was involved in Moderator selection issues this weekend. I will ask on the vBulletin developers site to see if I can get a better answer. |
|
#19
|
|||
|
|||
|
As Anthracite said - the word alone is not indexed because of its presence in the file of words not to be indexed.
Why does alone* work? Because it finds threads where you have the word alone followed by any number of non-blank characters. So "alone" by itself would not be indexed, but "aloneness" would be. |
|
#20
|
|||
|
|||
|
I see now what was being asked. I can't get the search string "*alone*" to return a thread which *only* has alone appearing as "alone". If anyone else can, then it would need investigation. Otherwise, I think everything is behaving as I said it was.
|
|
#21
|
|||
|
|||
|
A broad range of noise words is good, if it keeps the index size down. What's nasty is that vBulletin's search function just drops them from your query without notice.
![]() ("ain't"? Don't apostrophes get removed anyway and are treated as a word delimiter?) |
|
#22
|
|||
|
|||
|
FYI, with Tuba's permission, I am posting a link to the default list of ignored words supplied with Vbulletin's version 2.2.1. I have extracted this from the php code file badwords.php. Of course, there is no guarantee that the SDMB list is the same, but it's a good starting point.
badwords.txt Note that "ain't" and other contracted words are in this list. |
|
#23
|
|||
|
|||
|
Those are bad words? I always thought that "darn" and "shucks" were bad words. Was my mother wrong?
|
|
#24
|
|||
|
|||
|
Quote:
|
|
#25
|
|||
|
|||
|
wolfstu - the button shows up if you have a homepage listed in your profile. Since it's dynamic, it doesn't show up if you remove the homepage listed in your profile.
|
|
#26
|
|||
|
|||
|
Yeah, weird. Anthracite used to. I'm sure she had a plenty-good reason for removing it.
|
|
#27
|
|||
|
|||
|
Quote:
Quote:
Quote:
:: Rants in a corner :: |
|
#28
|
|||
|
|||
|
So when a search with wildcards is executed, is the board software performing a different type of search routine (maybe a brute-force search instead of an index search)? - certainly the wildcard search finds words that (unless the SDMB 'badwords' has been modified) aren't indexed.
Not that I really need to know, but I'm guessing that if it does use a brute force search for wildcards, then the server hit is going to be much more severe, isn't it? |
|
#29
|
|||
|
|||
|
Quote:
|
|
#30
|
|||
|
|||
|
The "*" wildcard can only be used at the end of a word, indicating that the index is still being used in wildcard searches.
|
|
#31
|
|||
|
|||
|
Hmm. I searched for "alon*" in ATMB for one month and found this thread, highlighting all the occurrences of "alone". As expected, "aloneness" in Arnold's post was also highlighted. Also as expected, "alongside" in a different thread was found, too.
|
|
#32
|
|||
|
|||
|
Quote:
(<- couldn't resist) From our very own search page (bolding mine):Quote:
IIRC the search index of this board is(was) about 80 MBytes, so I guess it's better to avoid scanning through that with a leading wildcard when the server is slow. Last edited by C K Dexter Haven; 01-15-2003 at 10:40 AM. |
|
#33
|
|||
|
|||
|
rowrrbazzle - I believe the explanation for what you see is this:
when you search for "alon*" you will find threads containing a word starting with "alon" but not the word "alone" since "alone" is in the "bad words" list. However, when the thread is called up, the "highligh=alon*" part of the link will highlight every word starting with "alon" including the word "alone". |
|
#34
|
|||
|
|||
|
wolfstu go away, that's my corner.
[continued rant]What's really nasty is that they're carried over to the highlight parameter as if nothing happened, pretending that they're the reason for the search hit.[/continued rant]Ah, Dubious got its second colon (like it better too). If a kind mod has the heart to fix mine up there? |
|
#35
|
|||
|
|||
|
So fixed, fem.
|
![]() |
| Bookmarks |
| Thread Tools | |
| Display Modes | |
|
|