The course will focus on conducting exploratory data analysis and finding typologies in the data using cluster analysis and latent class modelling. The course will consist of a theoretical part that in which students will learn how cluster analysis and latent class modelling work and of a practical part in which students will apply these methods (to their own data) using SPSS and R.
FSS - Finding Typologies in Data
Dr. Paulina Pankowska (p.k.pankowska@uu.nl)
Course description
Course objectives:
- Learn how to conduct exploratory data analysis and find typologies in the data using cluster analysis and latent class modelling.
- Learn how to use obtained typologies in further analysis (e.g., to establish associations between individual- characteristics and cluster membership).
- Focus on cluster analysis in SPSS and latent variable modelling (specifically latent profile analysis – LPA) in R.
Course content:
The course will consist of 10 meetings.
An overview of the topics covered in each meeting is provided below:
- (1st meeting: Introduction to R, if needed)
- 1st meeting: An introduction to cluster analysis (incl. general description, applications, and distinction between different clustering algorithms – i.e. deterministic vs. probabilistic and connectivity-based, centroid-based, distribution-based vs. density-based).
- 2nd meeting: Running k-means clustering (using the PhD’s own data or a dataset available online e.g., World Values Survey)
- 3rd meeting: Running hierarchical clustering (again, using the PhD’s own data or a dataset available online)
- 4th meeting: Running robustness checks and sensitivity analyses for k-means and hierarchical clustering results; addressing issues and errors
- 5th meeting: Running a multinomial logistic regression analysis using cluster memberships as dependent variable and a set of (individual-level) characteristics as independent variables
- 6th meeting: Introduction to latent variable modelling in general and latent profile analysis (LPA) and latent class analysis (LCA) specifically ·
- 7th and 8th meeting: Running LPA and LPCA in R
- 9th/10th meeting: Q&A
Study Characteristics
- Discipline: Quantitative Methods
- Language: English
- ECTS: 3
- Type of education: in class
- Academic skill: research
- Graduate School: Graduate School of Social Sciences
- Start date: 14 May 2024
- End date: 13 June
- Min. number of students: 3
- Max. number of students: 10
- Admission criteria: Basic statistical literacy (i.e., familiarity with at least one statistical software such as SPSS, Stata or R; familiarity with basic statistical analysis techniques, e.g., regression, anova, etc.)
- Concluding assessment: yes
- Assessment type: Take home assignment; students can choose one of the following three assignments:
- • Run a k-means cluster analysis, interpret and report results
- • Run a hierarchical cluster analysis, interpret and report results
- • Run LPA, interpret and report results
Each assignment should also include some robustness checks and mention the limitations of the method used
- With certificate: yes
- Roster/schedule info:
10 sessions, 15.00-18.00
Tuesday 14 May
Thursday 16 May
Tuesday 21 May
Thursday 23 May
Tuesday 28 May
Tuesday 4 June
Tuesday 11 June
Thursday 13 June
Tuesday 18 June
Thursday 20 June
- Registration deadline: 16 April 2024
- FSS PhD candidates and other PhD’s in the Social Sciences. No fee for PhDs from VU, ZU, AISSR PhD candidates. Fee for others € 450,-
- Name of teacher: Dr. Paulina Pankowska (p.k.p.pankowska@uu.nl), see personal UU page for more information.
-
FSS – Statistics: Finding Typologies in Data (An Introduction to Cluster Analysis and Latent Variable Modelling)
Course description
Course objectives:
- Learn how to conduct exploratory data analysis and find typologies in the data using cluster analysis and latent class modelling.
- Learn how to use obtained typologies in further analysis (e.g., to establish associations between individual- characteristics and cluster membership).
- Focus on cluster analysis in SPSS and latent variable modelling (specifically latent profile analysis – LPA) in R.
Course content:
The course will consist of 10 meetings.
An overview of the topics covered in each meeting is provided below:
- (1st meeting: Introduction to R, if needed)
- 1st meeting: An introduction to cluster analysis (incl. general description, applications, and distinction between different clustering algorithms – i.e. deterministic vs. probabilistic and connectivity-based, centroid-based, distribution-based vs. density-based).
- 2nd meeting: Running k-means clustering (using the PhD’s own data or a dataset available online e.g., World Values Survey)
- 3rd meeting: Running hierarchical clustering (again, using the PhD’s own data or a dataset available online)
- 4th meeting: Running robustness checks and sensitivity analyses for k-means and hierarchical clustering results; addressing issues and errors
- 5th meeting: Running a multinomial logistic regression analysis using cluster memberships as dependent variable and a set of (individual-level) characteristics as independent variables
- 6th meeting: Introduction to latent variable modelling in general and latent profile analysis (LPA) and latent class analysis (LCA) specifically ·
- 7th and 8th meeting: Running LPA and LPCA in R
- 9th/10th meeting: Q&A
Study Characteristics
- Discipline: Quantitative Methods
- Language: English
- ECTS: 3
- Type of education: in class
- Academic skill: research
- Graduate School: Graduate School of Social Sciences
- Start date: 14 May 2024
- End date: 13 June
- Min. number of students: 3
- Max. number of students: 10
- Admission criteria: Basic statistical literacy (i.e., familiarity with at least one statistical software such as SPSS, Stata or R; familiarity with basic statistical analysis techniques, e.g., regression, anova, etc.)
- Concluding assessment: yes
- Assessment type: Take home assignment; students can choose one of the following three assignments:
- • Run a k-means cluster analysis, interpret and report results
- • Run a hierarchical cluster analysis, interpret and report results
- • Run LPA, interpret and report results
Each assignment should also include some robustness checks and mention the limitations of the method used
- With certificate: yes
- Roster/schedule info:
10 sessions, 15.00-18.00
Tuesday 14 May
Thursday 16 May
Tuesday 21 May
Thursday 23 May
Tuesday 28 May
Tuesday 4 June
Tuesday 11 June
Thursday 13 June
Tuesday 18 June
Thursday 20 June
- Registration deadline: 16 April 2024
- FSS PhD candidates and other PhD’s in the Social Sciences. No fee for PhDs from VU, ZU, AISSR PhD candidates. Fee for others € 450,-
- Name of teacher: Dr. Paulina Pankowska (p.k.p.pankowska@uu.nl), see personal UU page for more information.