<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Komodia&#039;s URL classification service</title>
	<atom:link href="http://url.komodia.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://url.komodia.com</link>
	<description>Providing classification services for the enterprise and parental control solutions</description>
	<lastBuildDate>Sun, 20 May 2012 15:26:19 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>Web Filtering Via URL Classification</title>
		<link>http://url.komodia.com/2010/06/30/web-filtering-via-url-classification/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=web-filtering-via-url-classification</link>
		<comments>http://url.komodia.com/2010/06/30/web-filtering-via-url-classification/#comments</comments>
		<pubDate>Wed, 30 Jun 2010 04:16:18 +0000</pubDate>
		<dc:creator>chris</dc:creator>
				<category><![CDATA[URL Categorization]]></category>
		<category><![CDATA[URL classification]]></category>
		<category><![CDATA[web filter]]></category>
		<category><![CDATA[web filtering]]></category>

		<guid isPermaLink="false">http://url.komodia.com/blog/?p=40</guid>
		<description><![CDATA[Companies and network administrators use web filtering capabilities to avoid employees accessing non-productive or objectionable sites.  These are sites that take valuable program resources from your network and at the same time expose you and your system to aversive actions.  This can be everything from fines and lawsuits to hackers, malware, and viruses.  Also, inappropriate [...]]]></description>
			<content:encoded><![CDATA[<p>Companies and network administrators use web filtering capabilities to avoid employees accessing non-productive or objectionable sites.  These are sites that take valuable program resources from your network and at the same time expose you and your system to aversive actions.  This can be everything from fines and lawsuits to hackers, malware, and viruses.  Also, inappropriate Internet usage drains company and business resources.</p>
<p>Hackers and their malicious programs may gain access to your network via Internet connections. This can be avoided with web filtering (through URL classification).</p>
<p>A web filter lets corporate managers and network administrators avoid employees from accessing URLs that contain content material that the organization finds objectionable. Decisions on objectionable content material are generally determined on the basis of legal or ethical concerns, employee productivity, network security, or any other issue. By using a URL classification web filter, you can ban URLs that are classified in everything from pornography to social networking.</p>
<p>Chris</p>
]]></content:encoded>
			<wfw:commentRss>http://url.komodia.com/2010/06/30/web-filtering-via-url-classification/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Network Administrator By Proxy</title>
		<link>http://url.komodia.com/2010/06/27/network-administrator-by-proxy/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=network-administrator-by-proxy</link>
		<comments>http://url.komodia.com/2010/06/27/network-administrator-by-proxy/#comments</comments>
		<pubDate>Sun, 27 Jun 2010 04:01:38 +0000</pubDate>
		<dc:creator>chris</dc:creator>
				<category><![CDATA[URL Categorization]]></category>
		<category><![CDATA[automatic url classification]]></category>
		<category><![CDATA[URL classification]]></category>
		<category><![CDATA[urls]]></category>

		<guid isPermaLink="false">http://url.komodia.com/blog/?p=37</guid>
		<description><![CDATA[Malevolent Internet websites really are a cornerstone of online criminal actions. Consequently, there&#8217;s been wide-ranging interest in creating methods to avoid the end user from going to such websites. One answer to this growing concern is automated URL classification. This method uses statistical techniques to uncover the properties of malicious and malware website URLs. These [...]]]></description>
			<content:encoded><![CDATA[<p>Malevolent Internet websites really are a cornerstone of online criminal actions. Consequently, there&#8217;s been wide-ranging interest in creating methods to avoid the end user from going to such websites.</p>
<p>One answer to this growing concern is automated URL classification. This method uses statistical techniques to uncover the properties of malicious and malware website URLs. These techniques would be capable of understanding and analyzing signs of potentially suspect URLs with very few false positives.</p>
<p>Obviously, URL classification has enormous benefits to the network adminstrator. As an adminstrator, it is always frustrating when a network user exposes the system to a threat that the administrator would never allow if he were present. URL classification allows the administrator to have a proxy presence and deny the suspicious website automatically.</p>
<p> Chris</p>
]]></content:encoded>
			<wfw:commentRss>http://url.komodia.com/2010/06/27/network-administrator-by-proxy/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DIY URL Categorization</title>
		<link>http://url.komodia.com/2010/06/26/diy-url-categorization/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=diy-url-categorization</link>
		<comments>http://url.komodia.com/2010/06/26/diy-url-categorization/#comments</comments>
		<pubDate>Sat, 26 Jun 2010 07:45:50 +0000</pubDate>
		<dc:creator>chris</dc:creator>
				<category><![CDATA[URL Categorization]]></category>

		<guid isPermaLink="false">http://url.komodia.com/blog/?p=35</guid>
		<description><![CDATA[I have a friend who thinks that he can build his own URL database.  &#8220;Why should I pay somebody else to classify URL addresses for me?&#8221; he says.  This is the kind of guy who like to organize his video collection by genre and date.  His socks are all in color co-ordinated separate drawers.  He [...]]]></description>
			<content:encoded><![CDATA[<p>I have a friend who thinks that he can build his own URL database.  &#8220;Why should I pay somebody else to classify URL addresses for me?&#8221; he says.  This is the kind of guy who like to organize his video collection by genre and date.  His socks are all in color co-ordinated separate drawers.  He seems to think that the Internet is within his OCD grasp.</p>
<p>Well, since the Internet gains and loses hundreds of thousands of sites per day, I think this is just one task that my friend is not going to be able to tackle on his own.  His DIY (do it yourself) skills have certainly met their match.</p>
<p>When I called him last and asked him how he was doing, he sounded kind of frustrated.  He said he would call me back when he was done.  I am expecting that call on the first Monday of never!</p>
<p>Chris</p>
]]></content:encoded>
			<wfw:commentRss>http://url.komodia.com/2010/06/26/diy-url-categorization/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Incorrect Classification</title>
		<link>http://url.komodia.com/2010/06/24/incorrect-classification/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=incorrect-classification</link>
		<comments>http://url.komodia.com/2010/06/24/incorrect-classification/#comments</comments>
		<pubDate>Thu, 24 Jun 2010 16:36:13 +0000</pubDate>
		<dc:creator>chris</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[incorrect classification]]></category>
		<category><![CDATA[incorrect url classification]]></category>
		<category><![CDATA[mis-classification]]></category>

		<guid isPermaLink="false">http://url.komodia.com/blog/?p=33</guid>
		<description><![CDATA[Recently I was on Yahoo Answers.  I will go there occasionally when I have a practical problem of which I feel there is a simple, commonsense answer that escapes me.  I can generally find the answer to my problem within minutes of posting my question. The question I had the other day was about which [...]]]></description>
			<content:encoded><![CDATA[<p>Recently I was on Yahoo Answers.  I will go there occasionally when I have a practical problem of which I feel there is a simple, commonsense answer that escapes me.  I can generally find the answer to my problem within minutes of posting my question.</p>
<p>The question I had the other day was about which portable camping grill was the best.  I received about a dozen answers and one of them sent my to a nice little portal site that specialized in portable camping grills.  I found good reviews and very low prices.</p>
<p>The problem arose when I tried to award the answer that I liked best as the &#8220;Best Answer.&#8221;  For some reason my security software had that page of Yahoo Answers classified as a computer threat.  When I scanned the page, there were no problems, but my security software was not to be persuaded.</p>
<p>That is one potential limitation of URL categorization:  URL mis-classification.  If the URL is classified incorrectly, then potentially every page of the site can be as well.</p>
<p>The other side of this coin is a similar problem.  If the site is fine when it is first classified but then becomes infected at a later date, then you will have an undetected malware site within your categorization as a clean site.</p>
<p>Chris</p>
]]></content:encoded>
			<wfw:commentRss>http://url.komodia.com/2010/06/24/incorrect-classification/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Empty URLs</title>
		<link>http://url.komodia.com/2010/06/22/empty-urls/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=empty-urls</link>
		<comments>http://url.komodia.com/2010/06/22/empty-urls/#comments</comments>
		<pubDate>Tue, 22 Jun 2010 07:32:13 +0000</pubDate>
		<dc:creator>chris</dc:creator>
				<category><![CDATA[URL Categorization]]></category>
		<category><![CDATA[empty url]]></category>
		<category><![CDATA[empty urls]]></category>
		<category><![CDATA[word noise]]></category>

		<guid isPermaLink="false">http://url.komodia.com/blog/?p=29</guid>
		<description><![CDATA[I have briefly discussed the problem of word noise in a previous post.  This is when a text based classification system is stymied by too much content.  An over-abundance of content &#8211; especially content from varying topics &#8211; creates an impossibility for classification.  If there is business related content and video game related content and gardening [...]]]></description>
			<content:encoded><![CDATA[<p>I have briefly discussed the problem of word noise in a previous post.  This is when a text based classification system is stymied by too much content.  An over-abundance of content &#8211; especially content from varying topics &#8211; creates an impossibility for classification.  If there is business related content and video game related content and gardening related content and sports related content, how in the world can you accurately classify the site?</p>
<p>Well, empty URLs create the opposite (but equal) problem for URL classification.  How could a URL classifying software program correctly classify a site such as wbj.org or wsj.com as a business site based wholly on the URL.  It can&#8217;t.  And the reason is because the URL gives no evidence to its true content.  This is what is called an empty URL.</p>
<p>However, most of your empty URLs will still have other classifiable pages that are indexed.  wbj.org/business-in-washington is an example.  So, even your empty URLs can be classified eventually &#8211; once other pages from within the site are classified themselves. </p>
<p>Still, it does warrant mentioning that empty URLs are a potential limitation of the URL classifying method.</p>
<p>Chris</p>
]]></content:encoded>
			<wfw:commentRss>http://url.komodia.com/2010/06/22/empty-urls/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Port Blocking &amp; URL Categorization</title>
		<link>http://url.komodia.com/2010/06/20/port-blocking-url-categorization/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=port-blocking-url-categorization</link>
		<comments>http://url.komodia.com/2010/06/20/port-blocking-url-categorization/#comments</comments>
		<pubDate>Sun, 20 Jun 2010 06:17:05 +0000</pubDate>
		<dc:creator>chris</dc:creator>
				<category><![CDATA[URL Categorization]]></category>
		<category><![CDATA[port blocking]]></category>

		<guid isPermaLink="false">http://url.komodia.com/blog/?p=27</guid>
		<description><![CDATA[A network administer may wish to stop users on his network from accessing certain kinds of sites.  For instance, if all of your users are spending a great deal of time on Facebook, then an administrator may wish to block this particular site or &#8220;social networking&#8221; sites in general. Port blocking and URL categorization is [...]]]></description>
			<content:encoded><![CDATA[<p>A network administer may wish to stop users on his network from accessing certain kinds of sites.  For instance, if all of your users are spending a great deal of time on Facebook, then an administrator may wish to block this particular site or &#8220;social networking&#8221; sites in general.</p>
<p>Port blocking and URL categorization is a way to effectively accomplish this.  URL categorization creates a database of URL addresses and these can be classified in a multitude of ways.  Facebook, as an example, may be classified as &#8220;social networking.&#8221;   By using port blocking, the administrator could block all sites in the &#8220;social networking&#8221;category.  This would stop all visits to every site within that category (like MySpace) - and not just Facebook.</p>
<p>The limitation, of course, would be in any site that is not yet classified or is not classified within the &#8220;social networking&#8221; category.</p>
<p>Chris</p>
]]></content:encoded>
			<wfw:commentRss>http://url.komodia.com/2010/06/20/port-blocking-url-categorization/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>One Limitation of URL Classification</title>
		<link>http://url.komodia.com/2010/06/18/one-limitation-of-url-classification/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=one-limitation-of-url-classification</link>
		<comments>http://url.komodia.com/2010/06/18/one-limitation-of-url-classification/#comments</comments>
		<pubDate>Fri, 18 Jun 2010 05:56:34 +0000</pubDate>
		<dc:creator>chris</dc:creator>
				<category><![CDATA[URL Categorization]]></category>
		<category><![CDATA[limitation of url classification]]></category>
		<category><![CDATA[url categorization limitations]]></category>

		<guid isPermaLink="false">http://url.komodia.com/blog/?p=24</guid>
		<description><![CDATA[It is true that by creating a large database of malicious URL addresses, URL classification can allow a network administrator the power to completely block a malicious or suspicious URL.  There is, of course, one small limitation of this method: What happens when a user on your network is the first to stumble upon a [...]]]></description>
			<content:encoded><![CDATA[<p>It is true that by creating a large database of malicious URL addresses, URL classification can allow a network administrator the power to completely block a malicious or suspicious URL.  There is, of course, one small limitation of this method:</p>
<p>What happens when a user on your network is the first to stumble upon a malicious or infect site?</p>
<p>Well, since there is no record of the site within the category of malware or malicious sites, the URL is allowed.  Your network has just become a guinea pig.  Now that you have suffered this attack, the bad URL is correctly classified under malware and every subsequent visit is blocked.</p>
<p>Thanks to your network&#8217;s sacrifice, all future visitors are forewarned and saved.  Unfortunately, that will not help you one single iota.</p>
<p>I am not suggesting that URL classification is flawed in the least, I just wanted to post information regarding the one particular limitation.</p>
<p>Chris</p>
]]></content:encoded>
			<wfw:commentRss>http://url.komodia.com/2010/06/18/one-limitation-of-url-classification/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What Can Be Done With URL Categorization?</title>
		<link>http://url.komodia.com/2010/06/16/what-can-be-done-with-url-categorization/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=what-can-be-done-with-url-categorization</link>
		<comments>http://url.komodia.com/2010/06/16/what-can-be-done-with-url-categorization/#comments</comments>
		<pubDate>Wed, 16 Jun 2010 20:04:40 +0000</pubDate>
		<dc:creator>chris</dc:creator>
				<category><![CDATA[URL Categorization]]></category>
		<category><![CDATA[URL classification]]></category>

		<guid isPermaLink="false">http://url.komodia.com/blog/?p=21</guid>
		<description><![CDATA[The Internet is growing at an alarming rate.  Typically, ULR filtering has been traditional with an eye on history and semi permanent classification.  This is beginning to change.  So much more can be done with URL classification today.  Web content can be classified as you go now.  This way you get URL classification that is [...]]]></description>
			<content:encoded><![CDATA[<p>The Internet is growing at an alarming rate.  Typically, ULR filtering has been traditional with an eye on history and semi permanent classification.  This is beginning to change. </p>
<p>So much more can be done with URL classification today.  Web content can be classified as you go now.  This way you get URL classification that is immediate and that lives up to the required need of the individual.  It is important to have URL classification in current time.  This helps to find the right classification slot as well. </p>
<p>You can get link analysis and classification.  You can get protection from unwanted content online also.  Site history can be checked to see if it is reputable.  Malware can be detected as part of this as can any software that could harm your computer. </p>
<p>Do you want your URL classification and categorization system to build a database for you as you search in real time?  This is the kind of benefits you can now expect.</p>
<p>Chris</p>
]]></content:encoded>
			<wfw:commentRss>http://url.komodia.com/2010/06/16/what-can-be-done-with-url-categorization/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Given Only the URL, Can We Identify the Topic?</title>
		<link>http://url.komodia.com/2010/06/14/given-only-the-url-can-we-identify-the-topic/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=given-only-the-url-can-we-identify-the-topic</link>
		<comments>http://url.komodia.com/2010/06/14/given-only-the-url-can-we-identify-the-topic/#comments</comments>
		<pubDate>Mon, 14 Jun 2010 09:28:52 +0000</pubDate>
		<dc:creator>chris</dc:creator>
				<category><![CDATA[URL Categorization]]></category>
		<category><![CDATA[url identification]]></category>

		<guid isPermaLink="false">http://url.komodia.com/blog/?p=18</guid>
		<description><![CDATA[Given just the URL of a website, is it possible to identify the website&#8217;s topic?  And if so, why use this method? Certainly you can identify a web page  by its content.  You can use the actual text or hypertext or incoming/outgoing links or a combination of all these and others.  So if we follow [...]]]></description>
			<content:encoded><![CDATA[<p>Given just the URL of a website, is it possible to identify the website&#8217;s topic?  And if so, why use this method?</p>
<p>Certainly you can identify a web page  by its content.  You can use the actual text or hypertext or incoming/outgoing links or a combination of all these and others.  So if we follow the old adage: &#8220;If it aint broke, don&#8217;t fix it,&#8221; why would we want to use URL classification?</p>
<p>URL identification is the preferred classification when speed is a critical issue or when you want to alter website content before it is downloaded if it is of a questionable nature.  It also helps when website images can&#8217;t be seen for content analysis.  There are also several other reasons &#8211; depending on your needs.</p>
<p>URL classification is a great tool and will become more widely used in the future.</p>
<p>Chris</p>
]]></content:encoded>
			<wfw:commentRss>http://url.komodia.com/2010/06/14/given-only-the-url-can-we-identify-the-topic/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Word Noise and Site Classification</title>
		<link>http://url.komodia.com/2010/06/12/word-noise-and-site-classification/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=word-noise-and-site-classification</link>
		<comments>http://url.komodia.com/2010/06/12/word-noise-and-site-classification/#comments</comments>
		<pubDate>Sat, 12 Jun 2010 09:25:56 +0000</pubDate>
		<dc:creator>chris</dc:creator>
				<category><![CDATA[URL Categorization]]></category>
		<category><![CDATA[site classification]]></category>
		<category><![CDATA[text summarization]]></category>
		<category><![CDATA[website categorization]]></category>
		<category><![CDATA[word noise]]></category>

		<guid isPermaLink="false">http://url.komodia.com/blog/?p=16</guid>
		<description><![CDATA[In a previous post I discussed the problems that traditional website categorization methods have with sites that lack text or links or anchor text.  Here is the other extreme:  word noise. Some websites simply have too much content that is all over the spectrum.  There is so much text, so much content, that the categorization [...]]]></description>
			<content:encoded><![CDATA[<p>In a previous post I discussed the problems that traditional website categorization methods have with sites that lack text or links or anchor text.  Here is the other extreme:  word noise.</p>
<p>Some websites simply have too much content that is all over the spectrum.  There is so much text, so much content, that the categorization of the website is muddled.  This phenomena is known as word noise, too many words.</p>
<p>It is true that some form of text summarization techniques have proven to be useful in these types of situations; however, an equally effective (if not more so) and less expensive method for classification in this situation would be URL categorization.</p>
<p>You may wonder if an incredibly large URL address might be considered a form of word noise when employing this form of classification.  Yet, even your large and ungainly website URL addresses are generally broken up into your root and then either post names, categories, archives, pages, blog titles, and tags.  Rather than creating word noise, these website URL practically categorize themselves due in large part to modern SEO techniques.</p>
<p>Chris</p>
]]></content:encoded>
			<wfw:commentRss>http://url.komodia.com/2010/06/12/word-noise-and-site-classification/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: basic
Database Caching using disk: basic
Object Caching 554/675 objects using disk: basic

Served from: url.komodia.com @ 2012-05-27 02:46:02 -->
