Open-source data science tools for students and scholars
Everything here will run on free or open source software, and will run on a reasonably modern laptop or desktop. I'll try to illustrate my code with interesting real data; you can choose to read or code-along.
Please install R and RStudio on your machine, in that order. Welcome to programming, we'll be doing a lot of copy + paste.
Recommended guides to complement the content on this page:
Requirements: An internet connection and a Twitter developer account with an approved Academic Track project.
Using an LDA Topic Model to summarize and visualize thematic trends in a longitudinal single author archival text data set (Elon Musk Tweets 2010-2022)
Requirements: A tabular text data set, with one row per document, and all the text in one column.
Requirements: A tabular text data set, with one row per document, and all the text in one column (Analysis dataset provided in-page)
To cite this page for code or research methodology, please use:
Bhardwaj, A. (2022) Code Library: Open-source data science tools for students and scholars. https://www.abhardwaj.net/code