nannyml.sampling_error.summary_stats module

nannyml.sampling_error.summary_stats.summary_stats_median_sampling_error(sampling_error_components, col) float[source]

Calculate sampling error for Summary Stats Median using reference data.

Using Asymptotic variance formula from https://stats.stackexchange.com/a/61759 https://en.wikipedia.org/wiki/Median#Sampling_distribution

Parameters:
  • sampling_error_components (a set of parameters that were derived from reference data.) –

  • col (the (analysis) column you want to calculate sampling error for.) –

Returns:

sampling_error

Return type:

float

nannyml.sampling_error.summary_stats.summary_stats_median_sampling_error_components(col: Series) Tuple[source]

Calculate sampling error components for Summary Stats Median using reference data.

Parameters:

col (pd.Series) – column for which we are calculating sampling error components

Returns:

(median, pdf(median)

Return type:

Tuple[np.ndarray]

nannyml.sampling_error.summary_stats.summary_stats_std_sampling_error(sampling_error_components, col) float[source]

Calculate sampling error for Summary Stats Standard Deviation using reference data.

Standard Error of Standard Deviation, https://stats.stackexchange.com/a/157305 CR Rao (1973) Linear Statistical Inference and its Applications 2nd Ed, John Wiley & Sons, NY

Parameters:
  • sampling_error_components – a set of parameters that were derived from reference data.

  • col – the (analysis) column you want to calculate sampling error for.

Returns:

sampling_error

Return type:

float

nannyml.sampling_error.summary_stats.summary_stats_std_sampling_error_components(col: Series) Tuple[source]

Calculate sampling error components for Summary Stats Standard Deviation using reference data.

Parameters:

col (pd.Series) – column for which we are calculating sampling error components

Returns:

(std, moment_4th)

Return type:

Tuple[np.ndarray]