This Internet MiniGuide Annotated Link Compilation is dedicated to the latest and most reliable resources for knowledge discovery available on the internet. With the addition of new and pertinent information added online continuously, it is very easy to experience information overload. The key is to be able to find the important knowledge discovery resources and sites both in the visible and invisible World Wide Web. The following selected knowledge discovery resources and sites offer a range of knowledge and information discovery sources to help you accomplish your research. Also, visit the Knowledge Discovery Subject Tracer Information Blog for updates. Other white papers and subject tracers by Marcus P. Zillman are available by clicking here.

ACM SIGKDD: Current Explorations Issuehttp://www.acm.org/sigs/sigkdd/explorations/issue.php?issue=current Explorations is published twice yearly, in June/July and in December/January each year. The newsletter is distributed in hardcopy form to all members of the ACM SIGKDD. It is also sent to ACM’s network of libraries. Online versions are available on the web free to the general public. Their goal is to make the SIGKDD Newsletter an informative, rapid means of publication and dynamic forum for communication with the Knowledge Discovery and Data Mining community.

Advanced Knowledge Technologieshttp://www.aktors.org/ The Advanced Knowledge Technologies (AKT) project aims to develop and extend a range of technologies providing integrated methods and services for the capture, modelling, publishing, reuse and management of knowledge. AKT is a multi-million pound , six year collaboration between internationally recognized research groups at the Universities of Aberdeen, Edinburgh, the Open University, Sheffield and South Hampton.

APECKS: a Tool to Support Living Ontologieshttp://ksi.cpsc.ucalgary.ca/KAW/KAW98/tennison/index.html Ontology servers are currently under-developed in terms of the support they provide for collaborative activities on their content. This paper presents the APECKS (Adaptive Presentation Environment for Collaborative Knowledge Structuring) system, an ontology server which supports collaboration by allowing individuals to create personal ontologies. These ontologies can be compared with others’ to prompt discussion about the sources of their differences and similarities.

Association of KnowledgeWork (AOK)http://www.kwork.org/index.html At the Association of Knowledgework, people from every specialty cross professional, geographic, cultural, economic and hierarchical barriers to learn together. Not just another website, this is a virtual home for those who work with knowledge.

BAYESIA: Bayesian Networks and Data Mining Toolhttp://www.bayesia.com/ The Bayesian Network approach merges and supersedes existing approaches coming from Artificial Intelligence and Data Mining, both symbolic and statistical ones. Bayesian Networks are rigorously justified, provide a distributed knowledge representation, and are as understandable as a rule base. They deal particularly well with uncertainty, and they can be manually generated by consultation of an expert, or inductively built by machine learning.

Bibliomining Information Center – Data Mining for Librarieshttp://www.bibliomining.com/ The basic definition is “data mining for libraries.” For years, bibliometrics has been used to track patterns in authorship, citation, etc. Today, there are many more tools available for discovering similar patterns in complex datasets from data mining and statistics. In addition, tools from management science such as Online Analytical Processing (OLAP) can be used to explore the data for patterns. Therefore, a more complex definition is: Bibliomining is the combination of data mining, bibliometrics, statistics, and reporting tools used to extract patterns of behavior-based artifacts from library systems.

Brint.com – Business Technology – Information Economy – Knowledge Managementhttp://km.brint.com/ KMNetwork and the WWW Virtual Library of Knowledge Management combined to bring together an excellent resource for research papers and portals on knowledge management and discovery. In depth research articles and research portals from over the entire global Internet discuss the business, technologies, processes, systems, sociology, creativity, psychology and philosophy of Knowledge Management.

Creative Commons RDF-Enhanced Searchhttp://search.creativecommons.org/ This search engine will help you find photos, music, text, books, educational material, and more that is free to share or build upon. Copyright applies fully and automatically to any work. a photograph, a song, a web page, an article, pretty much any form of expression, the moment it is created. This means that if you want to copy and re-use a creative work you find online, you usually have to ask the author’s permission. This “all rights reserved” protection is good thing for many authors and artists. But what about those who want you to use their work freely without permission — but on certain conditions? This search engine helps you quickly find those authors and the work they have marked as free to use with only “some rights reserved.” If you respect the rights they have reserved (which will be clearly marked, as you’ll see) then you can use the work without having to contact them and ask. In some cases, you may even find work in the public domain — that is, free for any use with “no rights reserved.”

Conceptual Graphshttp://conceptualgraphs.org/ Conceptual graphs (CGs) are a system of logic based on the existential graphs of Charles Sanders Peirce and the semantic networks of artificial intelligence. They express meaning in a form that is logically precise, humanly readable, and computationally tractable. With their direct mapping to language, conceptual graphs serve as an intermediate language for translating computer-oriented formalisms to and from natural languages. With their graphic representation, they serve as a readable, but formal design and specification language. CGs have been implemented in a variety of projects for information retrieval, database design, expert systems, and natural language processing.

D2K – Data To Knowledgehttp://alg.ncsa.uiuc.edu/do/tools/d2k D2K – Data to Knowledge is a rapid, flexible data mining and machine learning system that integrates analytical data mining methods for prediction, discovery, and deviation detection, with data and information visualization tools. It offers a visual programming environment that allows users to connect programming modules together to build data mining applications and supplies a core set of modules, application templates, and a standard API for software component development.

Explore Open Archiveshttp://opcit.eprints.org/explorearchives.shtml This site lists and comments on other lists of individual open archives. This list and its categorisation gives a broad overview of the structure, size and progress of full-text open access eprint archives. This list will be maintained and updated as far as is possible, and is intended to assist further quantitative research on the open access eprint phenomenon for those who want to measure the growth and quality of open access eprint archives.

Global Knowledge Partnership (GKP)http://www.globalknowledge.org/ The Global Knowledge Partnership (GKP) is a worldwide network committed to harnessing the potential of information and communication technologies (ICTs)* for sustainable and equitable development. GKP’s vision is a world of equal opportunities where all people can access and use knowledge and information to improve their lives. The network enables the sharing of information, experiences and resources to help reduce poverty and empower people.

GMDH – Group Method of Data Handlinghttp://come.to/GMDH Group Method of Data Handling was applied in a great variety of areas for data mining and knowledge discovery, forecasting and systems modeling, optimization and pattern recognition. Inductive GMDH algorithms give possibility to find automatically interrelations in data, to select optimal structure of model or network and to increase the accuracy of existing algorithms. This original self-organizing approach is substantially different from deductive methods used commonly for modeling. It has inductive nature – it finds the best solution by sorting-out of possible variants.

Gurteen Knowledge Websitehttp://www.gurteen.com/ This site acts as a gateway Knowledge Management, Learning, Thinking, Creativity, Personal Mastery; Personal Knowledge Management and the effective use of Technology. The site was created by and is maintained by David Gurteen, an UK-based knowledge consultant.

IBM Research – Knowledge Discovery & Data Mining (KDD)http://www.research.ibm.com/compsci/kdd/ IBM Research has been at the forefront of the exciting new area of Knowledge Discovery & Data Mining (KDD) from the very beginning. Key advances in robust and scalable data mining, methods for fast pattern detection from very large databases, text and web mining, and innovative business intelligence applications have come from our research laboratories. Links to their current projects as well as KDD links are available from this site.

Insightful Miner 3http://www.insightful.com/products/iminer/default.asp Insightful Miner is a highly scalable data mining and analysis workbench that gives new analysts and skilled modelers the ability to deploy predictive intelligence throughout the enterprise. Insightful Miner 3 increases support for large data environments with new versions for Windows and Solaris servers and adds many new features that allow data analysts and data miners to easily build and deploy analytic applications that boost product performance and improve the efficiency of critical business processes.

International Workshop on Peer-to-Peer Knowledge Management (P2PKM)http://www.p2pkm.org/ The P2PKM workshop is intended to serve as an active forum for researchers and practitioners, where they will have the possibility to exchange and discuss novel ideas, research results and experiences, laying in the intersection of the P2P, Knowledge Management (KM), Semantic Web, databases, pervasive computing, agents, as well as other related fields.

KBL(sm): A Registry of Library Knowledge Baseshttp://www.public.iastate.edu/~CYBERSTACKS/KBL.htm Library-created or library-related Knowledge Bases. A Knowledge Base / Knowledgebase may be defined as a database with a focus on empirical or practical knowledge. In recent years, Knowledge bases have become common components for many businesses and services.

KDD-2009: The 15th Annual ACM SIGKDD International Conference on Knowledge Discovery and Data Mininghttp://www.acm.org/sigs/sigkdd/kdd2009/ The 15th annual ACM SIGKDD conference is the premier international forum for data mining researchers and practitioners from academia, industry, and government to share their ideas, research results and experiences. KDD-09 will feature keynote presentations, oral paper presentations, poster sessions, workshops, tutorials, panels, exhibits, demonstrations, and the KDD Cup competition.

KDnuggets: Data Mining, Web Mining, and Knowledge Discovery Guidehttp://www.kdnuggets.com/ KDnuggets.com (KD stands for Knowledge Discovery) is the leading source of information on Data Mining, Web Mining, Knowledge Discovery, and Decision Support Topics, including News, Software, Solutions, Companies, Jobs, Courses, Meetings, Publications, and more. KDnuggets News has been widely recognized as the leading newsletter on Data Mining and Knowledge Discovery.

KmBloggerhttp://kmwiki.wikispaces.com/A resource covering site, resources, communities and related information about the relationship between blogs and knowledge management and knowledge discovery.

Know-Center – Austria’s Competence Center for Knowledge Managementhttp://www.know-center.at The Know-Center is Austria’s Competence Center for knowledge-based Applications and Systems. The Know-Center has its core competences in the fields of information technology as enabling technologies for knowledge management and in human-oriented knowledge management.

Knowledge Discoveryhttp://www.KnowledgeDiscovery.info/ A Subject Tracer™ Information Blog developed and created by Internet expert, author, keynote speaker and consultant Marcus P. Zillman, M.S., A.M.H.A. for monitoring knowledge discovery resources and sites on the Internet.

Knowledge Harvestinghttp://www.KnowledgeHarvesting.org/ Knowledge Harvesting is used to rapidly convert top-performer expertise into knowledge assets that enhance corporate valuation and protect the organization from knowledge degradation. The purpose of this site is to offer an extensive introduction to Knowledge Harvesting.

Knowledge Management Magazine – Inside Knowledgehttp://www.kmmagazine.com/ The original knowledge management publication. The knowledge that exists within your organisation is your only sustainable source of competitive advantage. They believe this makes knowledge management a strategic imperative for you. Each issue of Inside Knowledge is designed to provide you with the information you require to: 1) Learn from the mistakes and success stories of others, 2) Lower business costs and increase productivity across your organization, 3) Ensure the ongoing professional development of yourself and your colleagues, and 4) Keep on top of industry developments, new techniques and tools for knowledge management and knowledge discovery.

Knowledge Management Research Center – CIOhttp://www.cio.com/topic/1467/Knowledge_Management Making the most of intellectual capital. Topics in the Knowledge Management Research Center include: 1) Overview, 2) Strategy, 3) Process, 4) Measurement, 5) Technology, 6) Portal and Collaboration, 7) In the Know, 8) Case Studies, 9) Metrics, 10) CIO Radio, 11) Q&A, 12) Books, 13) Events, and 14) Newsletters. A comprehensive research center presented and updated by CIO. Knowledge Management Resource Center http://www.kmresource.com/ Knowledge Management Resource Center is a gateway to the world of Knowledge Management (KM). On this site you’ll find a comprehensive collection of KM resources, each reviewed and described to help you quickly locate what you’re looking for. You can explore knowledge management in their 17 departments, browse their bookstore, or search the site by keyword.

Linguistic Tools for Knowledge Discoveryhttp://www.montague.com/abstracts/discovery.htm The gaps between subject and functional boundaries are one of the best sources of breakthrough innovation. Yet for a variety of reasons — managerial, technical, and editorial — it’s often difficult to exploit them. In this article they use an example from their own research and experience to show how linguistic tools such as thesauri, glossaries, and navigation schemes can promote knowledge discovery by exposing potential linkages between seemingly unrelated subjects.

my Knowledge Explorer (mKE) and the mKR Languagehttp://mKRmKE.org/ my Knowledge Explorer (MKE) is an interactive tool for organizing knowledge. It helps the user to record, change and search knowledge, and provides extensive error checking to ensure the internal consistency of the knowledge. Interaction with mKE uses the mKR language. mKR is a very-high-level knowledge representation language with simple English-like statements, questions and commands, plus UNIX-shell-like variables, methods and control structures.

Megaputer Intelligencehttp://www.megaputer.com/Megaputer Intelligence Inc., a Delaware corporation established in May of 1997, is a leading developer and distributor of advanced software tools for data mining, text mining, and intelligent e-commerce personalization. Their tools help reveal knowledge hidden in data. They add intelligence and insight to every step of the business decision-making process. The mission of Megaputer is to provide customers around the world with top quality software tools for transforming raw data into knowledge and facilitating better business decisions.

T2K – Text to Knowledgehttp://alg.ncsa.uiuc.edu/do/tools/t2k The T2K (Text to Knowledge) tool provides text mining and analysis capabilities that have been specially designed to operate in and capitalize upon the complexity of rich natural language domains of very large stores of text and multimedia documents. T2K is a library of D2K modules that implements sophisticated algorithms for text analysis.

Telemakus – Mining and Mapping Research Findings to Promote Knowledge Discoveryhttp://www.telemakus.net/ The goal of the Telemakus System is to enhance the knowledge discovery process by developing retrieval, visual and interaction tools to mine and map research findings from the research literature. The objective of the research is to create, test and validate an infrastructure to permit the automation of the creation and maintenance of a searchable database that generates knowledge maps via query tools and concept mapping algorithms. We will also be applying natural language processing models and information analysis methods to ultimately speed up the scientific discovery process.

The Data MineThe Data Mine was launched in April 1994, to provide information about DataMining (AKA KnowledgeDiscoveryInDatabases or KDD). There are 6 separate DataMining topic areas (known as “webs”), each with an index. You could also start with the Introduction To Data Mining. Popular pages include: OnLine Analytical Processing (OLAP), Data Mining Journals, Data Mining Tutorials, Data Sources. Topic areas include: Data Mining Software, Data Mining Index, Data Mining General/Misc, People Working in Data Mining, and Data Mining Companies and Organizations.

The Protégé ProjectProtégé is an ontology editor and a knowledge-base editor. Protégé is also an open-source, Java tool that provides an extensible architecture for the creation of customized knowledge-based applications. Protégé’s OWL Plug-in now provides support for editing Semantic Web ontologies.

Visual Analytics – VisuaLinks, Link Analysis, Data Mining SoftwareVisuaLinks® is a platform-independent, graphical analysis tool used to discover patterns, trends, associations and hidden networks in any number and type of data sources. VisuaLinks presents data graphically uncovering underlying relationships and patterns. VisuaLinks addresses the entire analytical process – from access and integration to presentation and reporting – providing a single and complete solution to a broad range of data analysis needs.

UCI Knowledge Discovery in Databases Archivehttp://kdd.ics.uci.edu/ This is an online repository of large data sets which encompasses a wide variety of data types, analysis tasks, and application areas. The primary role of this repository is to enable researchers in knowledge discovery and data mining to scale existing and future data analysis algorithms to very large and complex data sets. The archive is intended to serve as a permanent repository of publicly-accessible data sets for research in KDD and data mining.http://www.visualanalytics.com/ VisuaLinks® is a platform-independent, graphical analysis tool used to discover patterns, trends, associations and hidden networks in any number and type of data sources. VisuaLinks presents data graphically uncovering underlying relationships and patterns. VisuaLinks addresses the entire analytical process – from access and integration to presentation and reporting – providing a single and complete solution to a broad range of data analysis needs.

Sabrina is also Researcher/Author of
beSpacific® - Accurate research surfacing documents and resources focused on law, technology, government reports, and knowledge discovery - with a global perspective. Updated daily since 2002 with a searchable database of 40,000 postings.