DIY URL Categorization

June 26th, 2010

I have a friend who thinks that he can build his own URL database.  “Why should I pay somebody else to classify URL addresses for me?” he says.  This is the kind of guy who like to organize his video collection by genre and date.  His socks are all in color co-ordinated separate drawers.  He […]

Port Blocking & URL Categorization

June 20th, 2010

A network administer may wish to stop users on his network from accessing certain kinds of sites.  For instance, if all of your users are spending a great deal of time on Facebook, then an administrator may wish to block this particular site or “social networking” sites in general. Port blocking and URL categorization is […]

What Can Be Done With URL Categorization?

June 16th, 2010

The Internet is growing at an alarming rate.  Typically, ULR filtering has been traditional with an eye on history and semi permanent classification.  This is beginning to change.  So much more can be done with URL classification today.  Web content can be classified as you go now.  This way you get URL classification that is […]

Word Noise and Site Classification

June 12th, 2010

In a previous post I discussed the problems that traditional website categorization methods have with sites that lack text or links or anchor text.  Here is the other extreme:  word noise. Some websites simply have too much content that is all over the spectrum.  There is so much text, so much content, that the categorization […]

All Sites Have a URL Address

June 10th, 2010

What do you do with a site that has no text?  If a site has nothing but images, how would you classify this website using traditional methods?  Would a text based classification even recognize the site at all? How do you use anchor text classification on a site that has no anchor text?  If the […]

URL Classification Is Truly Impressive

June 8th, 2010

I cannot tell you how impressed I am with the concept of URL categorization.  The more that I research this topic, the more impressed I become. The idea that classifying a website based solely on its URL is a worthwhile endeavor seems foolish at first hearing.  However, I have seen that not only is this […]

Uniform Resource Locator

June 6th, 2010

Webpage classification is done using a multitude of factors.  The most obvious, of course, is text or content.  The second most obvious is going to have to be the title and tags, followed closely by links and link structure.  There are many other things that can be considered, however.  There are the pages that point at […]

Website Turnaround

June 4th, 2010

If you have been on the Internet for more then a minute, you know that big changes happen routinely and that the cyberworld is constantly in flux.  Remember when Paypal introduced the ridiculous notion of a secure third-party payment program?  Or how about when eBay pretended that an online flea market was a good idea?  […]

URL Categorization

June 2nd, 2010

Are you the kind of person who has a good deal of DVDs and/or blue ray discs?  Have you ever sat down to organize your movie collection?  When I was a kid I would try to keep my music cassette tapes organized – the first Beastie Boys album was my prized possession.  I started to collect […]