The Straight Dope

Go Back   Straight Dope Message Board > Main > About This Message Board

Reply
 
Thread Tools Display Modes
  #1  
Old 06-28-2011, 05:27 PM
Stoid Stoid is offline
Charter Member
 
Join Date: Jun 1999
Location: City of Angels
Posts: 12,816
Is it possible to change the 4-letter search limit?

WAY too many meaningful words are three letters! Two most recent searches that failed because they were 3 letters: CSS and FOX.

Wah.
Reply With Quote
Advertisements  
  #2  
Old 06-28-2011, 05:47 PM
TubaDiva TubaDiva is offline
Mother's Little Helper
Administrator
 
Join Date: Mar 1999
Location: In the land of OO-bla-dee
Posts: 9,412
Searches are about the most intensive thing the system does -- we've always had to balance the resources used for searches versus all the other needs of the site, including rendering pages and writing posts.

We settled on the limit that we did because that seems to be the best balance of needs and makes the most equitable use of resources. If we went down any further it would affect everything else the system does in a profoundly negative direction.

You might try using Google for your more intensive searches and bypassing the faulty SDMB search engine altogether. Please see this excellent posting on the subject here:

http://boards.straightdope.com/sdmb/...4#post11650034
Reply With Quote
  #3  
Old 06-29-2011, 07:03 AM
Peter Morris Peter Morris is offline
Charter Member
 
Join Date: Apr 2003
Location: ___\o/___(\___
Posts: 8,458
Has Google indexed the whole site yet? Last I heard, a Google search typically missed about half the posts compared to a similar board search.
Reply With Quote
  #4  
Old 06-29-2011, 08:59 AM
Vinyl Turnip Vinyl Turnip is online now
Charter Member
 
Join Date: Mar 2002
Location: <--- <--- <---
Posts: 12,730
It hasn't bothered me yet on this site, but this same issue has caused great me frustration when trying to search other fora, particularly technical/product support boards. In some cases it renders the search function practically useless--- making it impossible to search for (most) file extension types and a great number of acronyms.
Reply With Quote
  #5  
Old 06-29-2011, 09:08 AM
Giles Giles is offline
Charter Member
 
Join Date: Apr 2004
Location: Newcastle NSW
Posts: 11,555
I gather that the problem is that the limitation has to be on number of letters, and not on a stop-word list. If you could search on three-letter words, you would require the system to search on very common words like "the" and "and", which are not terribly useful in searching. I suspect that the use of stop words would require major work on the software.
Reply With Quote
  #6  
Old 06-29-2011, 09:22 AM
TubaDiva TubaDiva is offline
Mother's Little Helper
Administrator
 
Join Date: Mar 1999
Location: In the land of OO-bla-dee
Posts: 9,412
vB says:

Quote:
Search Index Minimum Word Length

When using the vBulletin default search, this option limits the size of indexed words. The smaller this number is, the larger your search index, and conversely your database is going to be.
Including three letter words makes the database insanely large.
Reply With Quote
  #7  
Old 06-29-2011, 01:53 PM
Munch Munch is online now
Guest
 
Join Date: Mar 2000
Quote:
Originally Posted by Peter Morris View Post
Has Google indexed the whole site yet? Last I heard, a Google search typically missed about half the posts compared to a similar board search.
Where did you hear that? I can't imagine that being the case after about an hour of Google starting to index. A thread with 10 pages with the word "dominoes" on each one is going to come up with one hit on the SDMB, 10 hits on Google. In fact, googling "site:boards.straightdope.com dominoes" has 5390 hits, searching "dominoes" on the board comes up with 471 (bad example, since there's a thread in the Game Room with thousands of posts with "dominoes" in the title). "Congresswoman" gets 282 on the board, 2620 on Google.

Last edited by Munch; 06-29-2011 at 01:53 PM.
Reply With Quote
  #8  
Old 06-29-2011, 06:41 PM
GuanoLad GuanoLad is offline
Charter Member
 
Join Date: Sep 1999
Location: Where the wild roses grow
Posts: 18,126
I thought there was an "exceptions" list in vBulletin, where you can add in some relevant three letter words it can search. I used to look after a board where some three letter acronyms were common (DVD for example), so included many of them as searchable.
Reply With Quote
  #9  
Old 06-29-2011, 10:24 PM
MsWhatsit MsWhatsit is offline
Member
Member
 
Join Date: Jul 2000
Location: Columbus, OH
Posts: 11,400
Google seems to have indexed the SDMB pretty thoroughly at this point. I can't think of specific examples, but there have been a few times lately that I had a vague memory of a really old thread and did a Google search based on a few keywords I remembered from the thread, and Google had it.
Reply With Quote
  #10  
Old 06-29-2011, 10:26 PM
TubaDiva TubaDiva is offline
Mother's Little Helper
Administrator
 
Join Date: Mar 1999
Location: In the land of OO-bla-dee
Posts: 9,412
Yep, there is. "Words to be Included Despite Character Limit" What's your pleasure?
Reply With Quote
  #11  
Old 06-29-2011, 10:28 PM
Guinastasia Guinastasia is offline
Squirrelly Wrath
 
Join Date: Jul 2000
Location: Pittsburgh, PA
Posts: 44,753
Actually most boards I visit have a four letter search limit. So it's not just the dope.
Reply With Quote
  #12  
Old 06-29-2011, 11:05 PM
Stoid Stoid is offline
Charter Member
 
Join Date: Jun 1999
Location: City of Angels
Posts: 12,816
Quote:
Originally Posted by tubadiva View Post
yep, there is. "words to be included despite character limit" what's your pleasure?

dvd
css
fox
nbc
cbs
abc
msn
cdc
cms
vip
mp3
cgi
ram
cpu
php
sql
lcd
jpg
gif
Reply With Quote
  #13  
Old 06-29-2011, 11:09 PM
MsWhatsit MsWhatsit is offline
Member
Member
 
Join Date: Jul 2000
Location: Columbus, OH
Posts: 11,400
Mac.
Reply With Quote
  #14  
Old 06-29-2011, 11:44 PM
TubaDiva TubaDiva is offline
Mother's Little Helper
Administrator
 
Join Date: Mar 1999
Location: In the land of OO-bla-dee
Posts: 9,412
Okay, I've added these in. Let's see what happens.
Reply With Quote
  #15  
Old 06-29-2011, 11:54 PM
hajario hajario is offline
Charter Member
 
Join Date: Apr 2001
Location: Santa Barbara, California
Posts: 12,073
Very cool. Thanks for adding those.

Did you know that Stoid has only used "Fox" in a post three times and not since 2003? I would have never guessed that.
Reply With Quote
  #16  
Old 06-30-2011, 01:25 PM
johnpost johnpost is online now
Guest
 
Join Date: Jul 2009
companies that might be searched for might include

ibm
3m

dozens of past and present government agencies (in the USA)

and then there are lots country abbreviations that have two or three letters, for example.

usa
uk
prc
uae
Reply With Quote
  #17  
Old 06-30-2011, 03:36 PM
Chronos Chronos is offline
Charter Member
 
Join Date: Jan 2000
Location: The Land of Cleves
Posts: 47,932
A few more:

OSX
PS2
PS3
Wii
Reply With Quote
  #18  
Old 06-30-2011, 04:10 PM
johnpost johnpost is online now
Guest
 
Join Date: Jul 2009
also have been messages on

dos
os2
cbm
c64

and chemicals like for example

co
co2

and standards like

nec
Reply With Quote
  #19  
Old 06-30-2011, 05:41 PM
Chronos Chronos is offline
Charter Member
 
Join Date: Jan 2000
Location: The Land of Cleves
Posts: 47,932
Really, I think it'd be easier to allow 3-letter words by default, and just expand the badwords list. Most of the problem with searching 3-letter words comes from a very small number of repeat offenders like "the", "and", and "for".
Reply With Quote
  #20  
Old 07-01-2011, 04:29 PM
TubaDiva TubaDiva is offline
Mother's Little Helper
Administrator
 
Join Date: Mar 1999
Location: In the land of OO-bla-dee
Posts: 9,412
Chronos, that's not how it works. You have a point but the code writers don't always go along what seems to be the logical choice. YMMV, I guess.

I've added the latest suggestions. Thanks to everyone for their help.

Last edited by TubaDiva; 07-01-2011 at 04:30 PM.
Reply With Quote
Reply

Bookmarks

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump


All times are GMT -5. The time now is 03:52 PM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.

Send questions for Cecil Adams to: cecil@chicagoreader.com

Send comments about this website to: webmaster@straightdope.com

Terms of Use / Privacy Policy

Advertise on the Straight Dope!
(Your direct line to thousands of the smartest, hippest people on the planet, plus a few total dipsticks.)

Publishers - interested in subscribing to the Straight Dope?
Write to: sdsubscriptions@chicagoreader.com.

Copyright © 2013 Sun-Times Media, LLC.