McGill Library works with vendors whenever possible to include text and data mining into future agreements and can help negotiate access for specific projects. Some licensed databases that McGill has negotiated text data mining rights for are listed below.
If you would like access to any of these collections for text mining purposes, please contact your Liaison Librarian for assistance.
Large digital archives and publishers are increasingly making large corposus of text available for researchers to text mine. Here is a select list of sources that make text corpora freely available.
McGill's.txtLAB has metadata and full text data sets available for download here.