Skip to Main Content
| University Libraries
See Updates and FAQs for the latest library services updates. Subject Librarians are available for online appointments, and Virtual Reference has extended hours.

Text Analysis Tools

A companion to our Text and Data Mining Sources infoguide, this guide will take you through how to use several text analysis tools

Constellate

Constellate, a text and data analytics service from JSTOR and Portico, is a platform for learning and performing text analysis, building datasets, and sharing analytics course materials. The platform provides value to users in three core areas: they can teach and learn text analytics, build datasets from across multiple content sources, and visualize and analyze their datasets.

Getting Started with Constellate

1. Log in with JSTOR credentials or create an account.

2. Go to Constellate's dataset builder to begin. Filter your results by keyword, publication title, publication date, language, document type, provider, category, and download availability. Your dataset cannot exceed 25,000 documents because we are using the free tier of Constellate. You will see a bar graph of documents over time. Once you've filtered the results, click build.

3. You will be redirected to the Constellate dashboard. From the dashboard you have access to several tutorials for learning text analysis.

4. You can access your dataset in the datasets tab. If you click visualize you will see a visualization of trends across results and you have the ability to search within results.

5. To export your results, click download. You have multiple export options, including a CSV or JSONL sample preview of 1,500 documents; a CSV of metadata or ngrams; and a JSONL of the metadata, ngrams, and full text.