Talk:Statistical population
This level-4 vital article is rated Start-class on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | |||||||||||
|
Untitled
[edit]Is not a statistical population that from which a statistical sample is drawn, and about which statistical inferences are made about statistical parameters?
- Yes. The article is not now very well written; maybe I'll work on it at some point. Michael Hardy 18:43, 24 Dec 2003 (UTC)
Yes, I have (also?) had trouble understanding this article. At the time of writing it defines a "Statistical Population" in terms of a mathematical function applied to a "Population". Surely if the definition appears to be defined in terms of itself then this should be expained (i.e. "Population" is defined in terms of "Population"), (i.e. self-referentially), (which I think is/should be generally recognized as a bad principle for encyclopedia articles).
Surely what we are after here is a simple logical explanation of what the term "Statistical Population" means to ordinary people with some special interest, with minimal reference to existing specialized conventions.
I think an encyclopedia deserves to have the real meaning explained, not a (possibly shallow) reiteration of existing specialized and formalized definitions put into words.
I think the aim here is conceptual illustration, not rewordings of conventional formal definitions, exhibited without the original needs, understanding and explanation.
I also think Wikipedia could outline some policy regarding the preferable avoidance of re-worded versions of specialized conventional formal definitions, instead recommending tutorial explanation of abstract concepts in terms of more familiar down-to-earth concepts. Otherwise people need to do the three year university course to understand the article. (and possible the doctorate!)
Suggestion for an example-based introduction
[edit](I did not want to add this, since I am not even a math student, and I could be wrong on some points. But I believe that this approach will help many, many more readers understand the concept. I hope someone with more statistics knowledge could read it and consider adding it, instead of the current crows example. Note that my example also contains crows ;) )
West Hill Highschool has 422 students. The principle wants to know the average grade of the students. Fortunately, all the grades are written down in a protocol, and he can easily calculate the average. In this case, the set of all students is the population.
He also wants to know what the average weight is for the students, but this is not written down anywhere. He could ask them all to show up for weighing, but it would be a timeconsuming process. Instead he takes a sample of the students (population), i.e. a subset. With a sample of 50 students, he is able to do a reasonable estimate of the average weight of the all students (population).
In the highschool example, he knows exactly how many students there are, and if worth the trouble he could actually examine each entity in the population. But in other cases, the population can be of unknown size. E.g. the number of crows in a county. And even if the size of the population is known, it can be impossible to collect data for every entity.
Knowing the size of a population does not allow us to make better estimates, but it allows us to better know how much to trust in our estimates. Statistics is not simply about giving estimates, but about calculating the probability of different prognoses being true. Say we have made an estimate of the average weight of the crows, by weighing a sample of 25 and the average weight was 1000g. We might then estimate the average weight of the population to be 1000g, and claim, that the average weight of the whole population must be between 900g and 1100g. If there is a total of 30 crows in the county, that gives our claim a high probability of being true. However, if there are 50.000 crows in the county, the probability of our claim being true is smaller.
There are also populations that are infinite. Say we want to measure the mean temperature over a year. The population is points in time in that year. We could do measurements, every day, every hour, every minute, etc. But even if we make a measurement a million times per second, we still have not covered all points in time.
The terms sample and population make most sense in relation to eachother.
213.165.179.229 (talk) 19:54, 17 July 2011 (UTC)
German population is ambiguous: if by that is meant the people with German citizenship, then all do not have the same "genetic heritage" for one thing. — Preceding unsigned comment added by 2A01:E35:8AD5:C150:1935:D725:D0F5:9C34 (talk) 22:38, 6 August 2014 (UTC)
I would like to add a link to the "Sampling fraction" article by adding the sentence "The ratio of the size of this statistical sample to the size of the population is called a sampling fraction" after the sentence "In statistical inference, a subset of the population (a statistical sample) is chosen to represent the population in a statistical analysis." Does anyone have any comments or suggestions?Vs2357 (talk) 04:10, 3 April 2018 (UTC)
There are too many links now.
[edit]The request to add links has been overdone to the point where there are so many blue words highlighted that it distracts from the original content of the article itself. Many of the links have nothing to do with the understanding of the article. For example: Why do you provide a link to "Egyptian males" when we are talking about mathematical statistics, bell curves, and sub-samples of statistical data? We are NOT talking explicitly about human populations. We are talking about populations (meaning "sets") of data. Data collection. First of all, math students reading the article probably know that an Egyptian male is a man from the country of Egypt. You don't need to explain that to them. Looking up Egyptian males will do nothing to enhance their comprehension of the article. Links that don't help the user to comprehend the article better should be removed. They just burden the reader with ridiculous distractions. Who is in charge of removing links that are not helpful to the article? Can I do it? Do I need to get permission from someone first? Just Peg (talk) 17:06, 30 August 2024 (UTC)
- Same here. As a statistics student I clicked on the Wikipedia suggestion to add more links, and then finding out there are too many! I think it is completely ok if you remove them. Never thought I could be interested in sand while reading such an important statistical concept ahaha Rob Formi (talk) 10:02, 31 August 2024 (UTC)
- Just Peg, good point. Well, this is Wikipedia and no one is in charge (except for User:Ritchie333, who really runs the joint). I actually was checking the talk page because I noticed the same thing. So have at it. We have rules/guidelines, and WP:OVERLINK explains them. Thanks, Drmies (talk) 23:49, 4 September 2024 (UTC)