Part one
1.Suppose you have two datasets with the following summary statistics:
Dataset A: mean = 10, variance = 25
Dataset B: mean = 15, variance = 36
Which dataset has more variability, and why? Show your calculations.
Part two
Video: TED Talk – Why You Should Love Statistics
Video: TED Talk – Lies, Damned Lies and Statistics
1.In the TED Talk – Why You Should Love Statistics, list 5 reasons the speaker says as a reason you should love statistics. Do you agree with any of these? Why?
2. In the TED Talk – Lies, Damned Lies and Statistics, explain in detail why the speaker notes that deception is so prevalent in statistics. Why is this? What would you do to help rectify this issue?
3.In the TED Talk – How Statistics Supports our Intuition, list 5 key points the speaker states on why statistics does, in fact, support our intuition. Do you agree with these 5 points? Explain.
4.In the TED Talk – The Need for Statistical Literacy, list 5 points of the speaker on why we do, in fact, need statistical literacy. Do you agree with these points? Explain.
5.Please watch the lecture videos and the TED Talks and complete the readings before attempting this assignment. remember all work must be in your own words, no cutting or pasting from the Internet, and all responses must come from the Module 2 materials, not outside materials.
Complete in a Word document. To show calculations you can complete on paper, take a picture, then upload the image if you want.
– What is bias and how does it affect the field of statistics?
-What are the Halo Effect and the Horn Effect? How do they affect our interpretation of things?
– What is the difference between a population and a sample?
– What are deceptive statistics and why are they bad/dangerous/misleading?
– What is the difference between descriptive statistics and inferential statistics?
– What are outliers and how are they determined to be outliers?
– What is the difference between variance and standard deviation?
– Note the following Population Set:
P = {2, 5, 7, 4, 11, 15, 23, 34, 5, 19, 1200}
a.List any outliers
b. Calculate the mean.
c. Calculate the mode.
d. Calculate the median.
e. Calculate the range.
f. Calculate the IQR.
g. Calculate the variance.
h. Calculate the standard deviation.
i. List the 5 Number Summary.
j. Draw a box plot of the values (indicating the outliers, if any)
– Note the following Sample Set:
S = {120, 177, 201, 292,156,118, 220, 243, 167}
a. List any outliers
b. Calculate the mean.c. Calculate the mode.d. Calculate the median.e. Calculate the range.f. Calculate the IQR.g. Calculate the variance.h. Calculate the standard deviation.i. List the 5 Number Summary.j. Draw a box plot of the values (indicating the outliers, if any)
– What is the difference between continuous data and discrete data?
-What are the effects of changing units, or having inconsistent units in statistical analysis?
Part 3
What is meant by the vital phrase “Correlation is not equal to Causation”? Give an example of this.
What are outliers in data? Should you use them or get rid of them? Why or why not? Can outliers tell you anything about the population or sample?
Part4
What are the potential benefits and drawbacks of AI for society and the economy?
Assignment: Exploring the Housing Market in Python
Objective: To gain practical experience in importing and managing data in Python, and to use data visualization techniques to explore trends in the housing market.
Task: