Limitations and Challenges in Effective Web Data Mining

Web data mining and data collection is critical processcrawlers. Modern search engine crawlers or bot can
for many business and market research firms today.not access the entire web due to bandwidth limitations.
Conventional Web data mining techniques involveThere are thousands of internet databases that can
search engines like Google, Yahoo, AOL, etc andoffer high-quality, editor scanned and well-maintained
keyword, directory and topic-based searches. Sinceinformation, but are not accessed by the crawlers.
the Web's existing structure cannot provide high-quality,Almost all search engines have limited options for
definite and intelligent information, systematic web datakeyword query combination. For example Google and
mining may help you get desired business intelligenceYahoo provide option like phrase match or exact
and relevant data.match to limit search results. It demands for more
Factors that affect the effectiveness ofefforts and time to get most relevant information.
keyword-based searches include:Since human behavior and choices change over time,
• Use of general or broad keywords on searcha web page needs to be updated more frequently to
engines result in millions of web pages, many of whichreflect these trends. Also, there is limited space for
are totally irrelevant.multi-dimensional web data mining since existing
• Similar or multi-variant keyword semantics myinformation search rely heavily on keyword-based
return ambiguous results. For an instant word pantherindices, not the real data.
could be an animal, sports accessory or movie name.Above mentioned limitations and challenges have
• It is quite possible that you may miss many highlyresulted in a quest for efficiently and effectively
relevant web pages that do not directly include thediscover and use Web resources. Send us any of
searched keyword.your queries regarding Web Data mining processes to
The most important factor that prohibits deep webexplore the topic in more detail.
access is the effectiveness of search engine