Many animals, including humans, tend to live in groups, herds, flocks, bands, packs, shoals, or colonies of conspecific individuals. The size of these groups, as expressed by the number of people/etc in a group such as eight groups of nine people in each one, is an important aspect of their social environment. Group size tend to be highly variable even within the same species, thus we often need statistical measures to quantify group size and statistical tests to compare these measures between two or more samples. Group size measures are notoriously hard to handle statistically since groups sizes typically follow an aggregated distribution: most groups are small, few are large, and a very few are very large. Statistical measures of group size roughly fall into two categories.
Outsiders' view of group size
Group size is the number of individuals within a group;
Mean group size, the arithmetic mean of group sizes averaged over groups;
Median group size, the median of group sizes calculated over groups;
Confidence interval for median group size.
Insiders' view of group size
As Jarman pointed out, average individuals live in groups larger than average. Therefore, when we wish to characterize a typical individual’s social environment, we should apply non-parametric estimations of group size. Reiczigel et al. proposed the following measures:
Crowding is the size of a group that a particular individual lives in. Practically, it describes the social environment of one particular individual. This was called Individual Group Size in Jovani & Mavor's paper.;
Mean crowding, i.e. the arithmetic mean of crowding measures averaged over individuals ;
Confidence interval for mean crowding.
Example
Imagine a sample with three groups, where group sizes are one, two, and six individuals, respectively, then Generally speaking, given there are G groups with sizes n1, n2,..., nG, mean crowding can be calculated as:
Statistical methods
Due to the aggregated distribution of group members among groups, the application of parametric statistics would be misleading. Another problem arises when analyzing crowding values. Crowding data consist of non-independent values, or ties, which show multiple and simultaneous changes due to a single biological event. Reiczigel et al. discuss the statistical problems associated with group size measures and offer a free statistical toolset.
Literature
Debout G 2003. Le corbeau freux nicheur en Normandie: recensement 1999 & 2000. Cormoran,13, 115-121.