Dear Experimenter, Here's another set of frequently asked questions (FAQs) about doing design of experiments (DOE), plus alerts to timely information and free software updates. Here's what I cover in the body text of this DOE FAQ Alert (topics that delve into statistical detail are designated "Expert"): 1. FAQ: Setting
up a fractional design with multilevel factors, some of
which are categoric
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Original
Question
Question: "I am trying to help our process engineers with a DOE. They want to test four factors in the experiment—two categorical and two numeric. One of the numeric factors will be tested at three levels, and everything else is at two levels, so the total number of combinations will be 24 (3 x 2 x 2 x 2). Do you know of a good design that would allow me to meet these requirements and still be around 15 to 18 runs? I wish it was as simple as a standard 2^(41) halffractional twolevel factorial. The one factor (numeric) at three levels is causing the problem. Any advice?"
From: Scotland "I've been looking at the point prediction tool in DesignExpert software. (I think) I understand that the standard error (SE) mean is calculated as the product of the SEdesign and the standard deviation (Stdev) of the design. I am reasonably happy with what this means and how it is calculated. What I'm struggling to understand is the SEprediction. This seems to be the product of the standard deviation and the square root of one plus the standard error design squares (1+SEdesign^2). Is this correct? Where does the 1 come from in this term? I realize that what this calculation is trying to do is to take into account a variance component for the model. Why use 1 rather than the SE values for each of the components selected for the model? Any light you can shed on my confusion would be appreciated." Answer (from StatEase Consultant
Pat Whitcomb): The confidence interval is expected to contain the mean, or "true," value. The prediction interval is constructed to include a single observation. It incorporates uncertainty as to the location of true value, as well as additional uncertainty associated with any single observation. The 1 added to SEdesign in the formula accounts for this extra variation. If you want more detail, it can be found in most regression analysis text books. However, this is rarely mentioned in DOE textbooks." Aside from the mathematical details, I would like to add
that the prediction interval can be very helpful for assessing
individual confirmation runs on what is hoped to be optimal
conditions. Naturally results will vary. The prediction
interval provides an expectation on the amount of variation. This news is bit dated, but I just became aware of it thanks to Cliff Yee, President of Northwest Analytical (NWA) in Portland, Oregon. He said, "Good to hear your witty and interesting newsletters. How are you guys doing in response to PAT Guidelines by the FDA? I am wondering if you are seeing more investment by the Life Science industries in DOE consulting, training and software?" After tracking down the PAT Guidelines at http://www.fda.gov/cder/guidance/6419fnl.doc (Update 3/07: Link no longer available.) (also see http://www.fda.gov/cder/OPS/PAT.htm), I understand why Cliff believes it will generate interest in DOE. The document greatly encourages the use of planned statistical methods to explore how variations in component levels and process conditions will affect pharmaceuticals. If any of you readers can speak to this, please email me. 4. Book Alert: There is a new edition of "Statistics
for Experimenters" by Box, Hunter and Hunter
for Experimenters" by Box, Hunter and Hunter According to the listing at Amazon (see below), on May 27 WileyInterscience published "Statistics for Experimenters: Design, Innovation and Discovery" the second edition of the classic book on DOE by George E. P. Box, J. Stuart Hunter, and the late William G. Hunter. StatEase has a copy on order. According to the publisher, the "Second Edition is thoroughly revised and updated to reflect the changes in techniques and technologies since the publication of the classic First Edition. Among the new topics included are:  Graphical analysis of variance What intrigues me most, is the promise of "An appendix featuring quaquaversal quotes from a variety of sources ranging from noted statisticians and scientists to famous philosophers that embellish key concepts and enliven the learning process." I had to look up the word "quaquaversal." (It blew away my spellchecker!) I will not spoil your fun trying to decipher what it means.
