The Wikipedia Miner Toolkit is for developers and researchers who want closely examine the raw structure of Wikipedia. If you are looking for a pre-built Wikipedia-based ontology, then something like DBpedia or FreeBase will probably be more relevant to you. However, if you want to make use of the structure and content of Wikipedia, then this toolkit will make your work a lot easier.

What this does

What this doesn't do

Remember, Wikipedia Miner is entirely open source, and is free to evolve as you see fit.

Requirements

To run wikipedia miner, you will need lots of hard-drive space and around 3G of memory. On top of that, the toolkit requires:

If you only need Wikipedia's structure rather than it's full textual content, then you can save a lot of time by using one of our pre-summarized dumps (available here). Otherwise, you will need:

If you want to host your own Wikipedia Miner web services, then you will also need:

Licence

The Wikipedia Miner toolkit is open-source software, distributed under the terms of the GNU General Public License.