Fit_Normal_2P(failures=None, right_censored=None, show_probability_plot=True, print_results=True, CI=0.95, quantiles=None, optimizer=None, CI_type='time', method='MLE', force_sigma=None, downsample_scatterplot=True, **kwargs)¶
Fits a two parameter Normal distribution (mu,sigma) to the data provided. Note that it will return a fit that may be partially in the negative domain (x<0). If you need an entirely positive distribution that is similar to Normal then consider using Weibull.
- failures (array, list) – The failure data. Must have at least 2 elements if force_sigma is not specified or at least 1 element if force_sigma is specified.
- right_censored (array, list, optional) – The right censored data. Optional input. Default = None.
- show_probability_plot (bool, optional) – True or False. Default = True
- print_results (bool, optional) – Prints a dataframe of the point estimate, standard error, Lower CI and Upper CI for each parameter. True or False. Default = True
- method (str, optional) – The method used to fit the distribution. Must be either ‘MLE’ (maximum likelihood estimation), ‘LS’ (least squares estimation), ‘RRX’ (Rank regression on X), or ‘RRY’ (Rank regression on Y). LS will perform both RRX and RRY and return the better one. Default is ‘MLE’.
- optimizer (str, optional) – The optimization algorithm used to find the solution. Must be either ‘TNC’, ‘L-BFGS-B’, ‘nelder-mead’, or ‘powell’. Specifying the optimizer will result in that optimizer being used. To use all of these specify ‘best’ and the best result will be returned. The default behaviour is to try each optimizer in order (‘TNC’, ‘L-BFGS-B’, ‘nelder-mead’, and ‘powell’) and stop once one of the optimizers finds a solution. If the optimizer fails, the initial guess will be returned. For more detail see the documentation.
- CI (float, optional) – confidence interval for estimating confidence limits on parameters. Must be between 0 and 1. Default is 0.95 for 95% CI.
- CI_type (str, optional) – This is the confidence bounds on time or reliability shown on the plot. Use ‘none’ to turn off the confidence intervals. Must be either ‘time’, ‘reliability’, or ‘none’. Default is ‘time’. Some flexibility in names is allowed (eg. ‘t’, ‘time’, ‘r’, ‘rel’, ‘reliability’ are all valid).
- force_sigma (float, int, optional) – Used to specify the beta value if you need to force sigma to be a certain value. Used in ALT probability plotting. Optional input. If specified it must be > 0.
- quantiles (bool, str, list, array, None, optional) – quantiles (y-values) to produce a table of quantiles failed with lower, point, and upper estimates. Default is None which results in no output. To use default array [0.01, 0.05, 0.1,…, 0.95, 0.99] set quantiles as either ‘auto’, True, ‘default’, ‘on’. If an array or list is specified then it will be used instead of the default array. Any array or list specified must contain values between 0 and 1.
- downsample_scatterplot (bool, int, optional) – If True or None, and there are over 1000 points, then the scatterplot will be downsampled by a factor. The default downsample factor will seek to produce between 500 and 1000 points. If a number is specified, it will be used as the downsample factor. Default is True. This functionality makes plotting faster when there are very large numbers of points. It only affects the scatterplot not the calculations.
- kwargs – Plotting keywords that are passed directly to matplotlib for the probability plot (e.g. color, label, linestyle)
- mu (float) – the fitted Normal_2P mu parameter
- sigma (float) – the fitted Normal_2P sigma parameter
- mu_SE (float) – the standard error (sqrt(variance)) of the parameter
- sigma_SE (float) – the standard error (sqrt(variance)) of the parameter
- Cov_mu_sigma (float) – the covariance between the parameters
- mu_upper (float) – the upper CI estimate of the parameter
- mu_lower (float) – the lower CI estimate of the parameter
- sigma_upper (float) – the upper CI estimate of the parameter
- sigma_lower (float) – the lower CI estimate of the parameter
- loglik (float) – Log Likelihood (as used in Minitab and Reliasoft)
- loglik2 (float) – LogLikelihood*-2 (as used in JMP Pro)
- AICc (float) – Akaike Information Criterion
- BIC (float) – Bayesian Information Criterion
- AD (float) – the Anderson Darling (corrected) statistic (as reported by Minitab)
- distribution (object) – a Normal_Distribution object with the parameters of the fitted distribution
- results (dataframe) – a pandas dataframe of the results (point estimate, standard error, lower CI and upper CI for each parameter)
- goodness_of_fit (dataframe) – a pandas dataframe of the goodness of fit values (Log-likelihood, AICc, BIC, AD).
- quantiles (dataframe) – a pandas dataframe of the quantiles with bounds on time. This is only produced if quantiles is not None. Since quantiles defaults to None, this output is not normally produced.
- probability_plot (object) – the axes handle for the probability plot. This is only returned if show_probability_plot = True
If the fitting process encounters a problem a warning will be printed. This may be caused by the chosen distribution being a very poor fit to the data or the data being heavily censored. If a warning is printed, consider trying a different optimizer.
LL(params, T_f, T_rc)¶
LL_fs(params, T_f, T_rc, force_sigma)¶
logR(t, mu, sigma)¶
logf(t, mu, sigma)¶