It was discovered through analysis of the “Washington Post Police Shooting Report” that there are around 8768 data points with 12 different factors. Because there are null values, the mean and median that I obtained are likely improper. I used the.describe() method, and the resulting mean, standard deviation, and values were 37.28 and 12.99, respectively. The 2693 null values were primarily in the following categories: name, age, gender, armed, city, and flee.
The anova test, which is used instead of the t test to determine whether there are statistically significant differences between the means of three or more groups, was also learned about today. Also discovered that geopy.geokernels may be used to calculate geodesic distance.