Bitácora personal de Emilio López Cano, Estadístico (Data Scientist dicen ahora). Enseñando Estadística y Matemáticas en la Universidad de Castilla-La Mancha, y Data Science en la Universidad Rey Juan Carlos. Investigando en proyectos de la Universidad Rey Juan Carlos. Contribuyendo a estándares en AENOR. Formando en la Asociación Estadística de la Calidad. Escribo sobre estos temas.
jueves, 1 de diciembre de 2016
jueves, 24 de marzo de 2016
SixSigma R package new release (0.9-3)
I have just released a new version (0.9-3) of the SixSigma R package, with functions and data used in the books Six Sigma with R and Quality Control with R. The main changes in this new version are related to function ss.rr, for measurement systems analysis (Chapter 5 of Six Sigma with R). After the feedback received by the user Garry Kitchin (thank you very much Garry), I have included the alphaLim argument which allows to set a threshold to do the Gage R&R analysis without the appraisal/part interaction effect (default to 0.05). This is common practice when doing MSA and it was certainly needed an automatic handling of this, similarly to other well-known software packages. In addition, the default value for the sigma argument is now 6 instead of 5.15, in line with the Six Sigma approach.
Another important change is concerned with the F test in the ANOVA table of the full model, i.e., with interaction. The standard ANOVA table returned by R is for fixed effects, but an R&R study implies repeated measures, and therefore the error term (denominator of the F Statistic) should be the interaction one. Thus, the output of the ANOVA table has also changed to return the appropriate tests for the main effects (appraisal, part). What surprised me while checking the references, mainly the AIAG MSA Manual, which is one of the main sources for Six Sigma practitioners, is that in the example provided in page 127 repeatibility MS is used as error term, whilst in the appendix it is used the interaction MS (which is actually common practice in the rest of the references and commercial software).
By the way, I reviewed all the function code, included clearer comments, indentation, spaces, etc. Also a bug in the control limits of the range control chart has been corrected, along with some minor improvements.
I feel really encouraged to continue developing the package when receiving great feedback from people "working in industry where the grip of heavily commercialised software dominates, and feeling like lone warriors battling against this" ;-)
SixSigma package on github
SixSigma package on CRAN
Updates on sixsigmawithr.com
NOTE: It may take some days having the latest version on CRAN. If you read this before that happens, you can get the latest version from github running:
Another important change is concerned with the F test in the ANOVA table of the full model, i.e., with interaction. The standard ANOVA table returned by R is for fixed effects, but an R&R study implies repeated measures, and therefore the error term (denominator of the F Statistic) should be the interaction one. Thus, the output of the ANOVA table has also changed to return the appropriate tests for the main effects (appraisal, part). What surprised me while checking the references, mainly the AIAG MSA Manual, which is one of the main sources for Six Sigma practitioners, is that in the example provided in page 127 repeatibility MS is used as error term, whilst in the appendix it is used the interaction MS (which is actually common practice in the rest of the references and commercial software).
By the way, I reviewed all the function code, included clearer comments, indentation, spaces, etc. Also a bug in the control limits of the range control chart has been corrected, along with some minor improvements.
I feel really encouraged to continue developing the package when receiving great feedback from people "working in industry where the grip of heavily commercialised software dominates, and feeling like lone warriors battling against this" ;-)
SixSigma package on github
SixSigma package on CRAN
Updates on sixsigmawithr.com
NOTE: It may take some days having the latest version on CRAN. If you read this before that happens, you can get the latest version from github running:
library(devtools)
install_github(repo = "emilopezcano/SixSigma", ref = "develop")
lunes, 21 de marzo de 2016
Sobre la importancia de la Estadística para la ciencia
Artículo aparecido en elconfidencial.com: Los investigadores no saben estadística (y eso perjudica a la ciencia)
Recomiendo encarecidamente la lectura de este artículo (y al que le queden ganas de alguno de los artículos científicos a los que hace referencia). Muchos de los científicos que conozco son conscientes de la importancia de la estadística, es algo que realmente les preocupa. Pero cuando llega la hora de la verdad (por el motivo que sea: falta de medios, de ganas, de apoyo, ...) la realidad es que en la mayoría de los casos se aplican "recetas" más o menos conocidas que no duelan mucho. Food for thought.
(imagen enlazada de la noticia, Corbis)
Recomiendo encarecidamente la lectura de este artículo (y al que le queden ganas de alguno de los artículos científicos a los que hace referencia). Muchos de los científicos que conozco son conscientes de la importancia de la estadística, es algo que realmente les preocupa. Pero cuando llega la hora de la verdad (por el motivo que sea: falta de medios, de ganas, de apoyo, ...) la realidad es que en la mayoría de los casos se aplican "recetas" más o menos conocidas que no duelan mucho. Food for thought.
(imagen enlazada de la noticia, Corbis)
Suscribirse a:
Entradas (Atom)