I have briefly discussed the problem of word noise in a previous post. This is when a text based classification system is stymied by too much content.  An over-abundance of content – especially content from varying topics – creates an impossibility for classification. If there is business related content and video game related content and gardening […]