Prerequisite: Programming for Social Data Science – I
This course is a continuation of Programming for Social Data Science I and introduces students to the core principles, tools, and reasoning approaches in data science, with a specific focus on social and policy- relevant applications. It emphasizes exploratory data analysis (EDA), statistical thinking, introductory modeling paradigms, and data modeling practices essential for understanding and working with real-world datasets.
Given that much of social/policy data is textreports, tweets, interviews, news, etc.the course also includes foundational exposure to natural language processing (NLP), enabling students to interpret unstructured data as they embark on learning data science. A strong ethical framework is interwoven throughout the course to ensure students critically engage with issues of fairness, transparency, and accountability in data-driven decision-making. By integrating conceptual theory with applied analysis, the course prepares students to contribute meaningfully to data- informed governance, policy evaluation, and social research.