Doubt in activity 3, week 2

60 views
Skip to first unread message

hemangi...@gmail.com

unread,
Oct 16, 2020, 10:54:03 AM10/16/20
to Discussion forum for Statistics for Data Science I
World Health Organisation (WHO) conducted a study to examine how many human lives have been lost during Covid-19 pandemic. It gathered the data across all the countries and represented it graphically. WHO also wants to compare it with another data which shows the percentage of total population of each country affected by it. Which chart will be more suitable for this dataset?


Why in the above stated question, pareto chart will be used?? please explain


Regards,
Hemangini

Anand Iyer

unread,
Oct 17, 2020, 2:45:14 AM10/17/20
to Discussion forum for Statistics for Data Science I, hemangi...@gmail.com
Pareto is used when you wish to see the data points in a descending (ascending) sorted order, so that it's easy to point out the high-impact values.

In this case, when you graph the lost lives/population per country, you will want to see which country was most impacted by the Covid, and also answer the question if lost lives are more on countries with a higher population, or derive a suitable relation between the variables.  So, using a Pareto is suggested.

Anamika Verma

unread,
Oct 17, 2020, 4:07:55 AM10/17/20
to Discussion forum for Statistics for Data Science I, anandd...@gmail.com, hemangi...@gmail.com
Hi there,

I understand the explanation. But since the country count across the world is over 150, and representing that data as Pareto would lead to a very cluttered bar graph (Also, it is highly unlikely that multiple countries have the same COVID count, which can be presented in 'Others'). So, would it be better if we present this data in the table format? And, from response per se, it is 'None of the Above'.

Statistics 1 Support 1

unread,
Oct 17, 2020, 2:19:18 PM10/17/20
to Discussion forum for Statistics for Data Science I, Anamika Verma, anandd...@gmail.com, hemangi...@gmail.com
Hello, Anamika,

We can combine all the categories with lower frequencies into the 'Others' category, thereby we can reduce the countries of 150 to maybe 15. Therefore Pareto chart would be a correct option if we want to represent cases in descending order.

Best,
Ram,
Statistics-1 Course Instructor
Reply all
Reply to author
Forward
0 new messages