making interest searching "smarter" (once it becomes available again)

Essentially, I'd like the "interests" feature to be able to parse out redunancies, specifically words like "the". A spell checker would be cool, also, but not essential. The examples below better illustrate what I'm talking about.

Let's say that lj-user A has "the simpsons" as one of his/her interests, and is looking for other Simpsons fans. If lj-user B has just "simpsons" as an interest, however, user A won't find user B. Similarly, users who have "sifl and olly" as an interest will not easily find users with "sifl-n-olly" or "sifl & olly". Etc. Of course, if someone meant to write "the beatles" but instead wrote "the beetels", they won't be easily found, either. Deliberate mispellings, though, would be harder to deal with...

  • It would make searching by interest a lot easier.
  • When the "search by common interests" feature returns, it would obviously provide more accurate results.
  • Would spellchecking have to be involved? Which would also bring up the issue of:
  • Deliberate mispellings
  • Would this make the interest feature, which is already disabled because of overuse, even more difficult to maintain?
  • I don't really know anything about programming, so I have no idea what this would entail. But search engines like Google and Yahoo will often throw out words like "the" when performing searches, so I know it's not totally impossible.
  • Tags: interests, searches, § historical
