Syllabus
Unit 1
Introduction to Data Science
Data science pipeline and components; Data science Methodologies; Concepts of Machine Learning; Principles of research design; Supervised and unsupervised learning; Biases and variances in the data; Class imbalances
Unit 2
Text as Data
Text cleaning techniques, Text annotation, Keyword analysis, Sentiment analysis, Text classification, and summarization; NLP tools; Visualization of text data
Unit 3
Predictive modelling in social science
Data cleaning techniques. Data transformation; Linear Regression; Logistic Regression; Decision Trees; Classification and categorization techniques
Unit 4
Advanced tools for data collection and analysis
Machine learning Tools (WEKA); Web Scraping; Text Analytics tools without coding (MonkeyLearn)
Text Books / References
References
- Cioffi-Revilla, C. (2014). Introduction to computational social science. Springer. https://doi.org/10.1007/978-1-4471-5661-1
- Michael J. Crawley, The R Book, Second Edition, https://www.cs.upc.edu/~robert/teaching/estadistica/TheRBook.pdf
- Larson-Hall, J. (2015). A guide to doing statistics in second language research using SPSS and R. Routledge.
- Introduction to Social Data Science by David Garcia, 2021 https://dgarcia-eu.github.io/SocialDataScience/1_Introduction/011_IntroductionToSDS/Introduction.html
- Mariani, P., & Zenga, M. (Eds.). (2020). Data science and social research II: Methods, technologies and applications. Springer Nature.