Databases, Knowledge Discovery, Information Retrieval, and Web MiningSusanne Hoche, Ingolf Geist, Lourdes Pena Castillo, Nadine Schulz, Databases, Knowledge Discovery, Information Retrieval, and Web Mining. Chapter in CIGR Handbook of Agricultural Engineering Volume VI Information Technology, Axel Murnack (Ed.), pp. 168–184. September 2006. No electronic version available. External information
An abundance of digital information is now available, and large investments in data collection are being made in, for instance, the area of agribusiness. A successful exploitation of the gathered data, e.g., to extract valuable information, detect useful, frequent or extraordinary patterns, or to support complex decision processes, demands powerful means for storing, accessing and analyzing data. Database Management Systems (DBMSs) provide an efficient, integrated and standardized platform for data storage and access. Knowledge Discovery in Databases (KDD) aims at the semi-automatic discovery of useful information in large data collections usually stored in databases. Information Retrieval (IR) is concerned with gathering from unstructured and semantically fuzzy data, such as natural language texts, images, audio, or video, information relevant to a user-defined query. Web mining describes techniques to extract useful information from the World Wide Web. In this section, we present an overview of the state of the art of DBMSs and the emerging fields of KDD, IR, and web mining, and relate core methodologies to agricultural applications.