Statistical analysis (confidence intervals, central limit theorem, distribution modeling, hypothesis testing) of Spotify All Time Top 2000 Songs list.
Overview
Important statistical concepts and tools are explored using the Spotify All Time Top 2000 Songs dataset. The statistical concepts explored are: visualization plots (histogram, cumulative relative frequency plot, normal Q-Q plot, scatter plot, boxplot), point estimates (sample mean, median, mode, variance, standard deviation), confidence interval for the population mean, theoretical distribution fitting, central limit theorem, and hypothesis testing.
Technical Details:
Tech Stack: R (statsr, dplyr, ggplot2, fitdistrplus)
Results
Please review the markdown for conclusions drawn from the statistical analysis in R: Markdown
Want to connect?
Connect with me through LinkedIn, or reach out to me via email or phone number.