1. FAQ: Is it necessary to introduce replicate runs for factorial designs?

-----Original Question-----
Fabrication Engineer
"I have two questions:
—First: I would like to know how calculate the effect of each factor with no replicates for a general factorial design on 4 factors with 4 levels each. My Design-Expert software somehow determines the significance of each factor without the presence of any replicates in the design of experiment. How can this be? How does it calculates the variance? Should I introduce replicates?
—Second: What is the 'lack of fit'? It appears when I add replicates. If there is not any replicate, this parameter disappears from the ANOVA (analysis of variance)."

Answer (from Stat-Ease Consultant Wayne Adams):
"Effects in a factorial design can be estimated without any true replicates because there is hidden replication. The average outcome of a given treatment (level) can always be estimated and the difference between treatment means (or grand mean in the case of categoric factors) can be calculated. This produces an estimated effect and will eventually lead to an estimated coefficient if included in the fitted model.

If all the effects are estimated and included in the model then you are correct, there is not a valid test for significance. What if some of the treatment means were only slightly different? Would this small difference be attributed to a true factor effect or considered random noise in the system? We have a choice when fitting the model. We can choose to say the variation in the treatment means is mostly coming from random noise, or mostly coming from a true effect. If it is determined to be mostly noise (small effects) we pool all the noise effects together (pooled error). This pool of errors is compared to the larger effects. If the larger effects are enough larger (F-ratio), then they "test" significant.

Replicates show us variation that we KNOW belongs in the error pool. Nothing changed in the factors, yet the response measurements are somewhat different. This is called "pure error" in Design-Expert. The pure error is literally added to the pool of errors when the effects are tested. Because we know that some of the error comes from replicates we can also test the pooled error from the "too small" effects noted above against the "pure error". This is also a ratio of variances (F-ratio) and generates a p-value. A significant lack-of-fit indicates the pool of errors is larger than the pure error, which is usually interpreted to indicate the wrong model has been fitted.

Take care with the interpretation of lack-of-fit tests. If a process is extremely stable, the replicates will have very little variation. The tiny pure error this produces will create a significant lack-of-fit test. In these cases take a look at the adjusted and predicted R-squareds. If they are high and within 0.2 of each other then the model is representing the data and true response surface well. If there are no replicates, then there is no "pure error" estimate, therefore no lack-of-fit test.

Do you always need to replicate? No. Do replicates improve the analysis? Yes. Whether or not replicates are needed in a particular experiment depends on many things. For example, if the initial design falls short on power then we advise you add runs. However, if the initial design is a fraction of all combinations, then building a larger fraction is better than replicating the existing one, at least so far as power is concerned. Adding a number of replicated runs (we advise at least 4 — otherwise do not bother to do any) is useful for enabling the lack-of-fit test."

This answer by Wayne covers many bases. One that I'd like to reinforce is that when replicates are added to a design, be very wary of their agreement with each other being too good to be true, that is, not really reflecting the true error overall. I've seen cases where every replicate agreed exactly! Obviously someone simply copied over the responses rather than actually replicating the run. Other common mistakes are to simply re-sample or re-test the materials made during a given run, which underestimates the error by not going through a compleat process re-set. Finally, even replicates done fair and square, re-done from start to finish, might produce results that are too far off for the experimenter to accept, so they get discarded and another run is done that agrees better. That is not fair! — Mark

PS. Version 8 of Stat-Ease software offers a wonderful new tool for general factorials such as this 4x4x4x4 —the ability to plot effects on a half-normal plot. This makes it really easy to sort out at a glance which, if any, effects stand out from the rest (the vital few). Simply click these into your provisional model (subject to verification via ANOVA and residual diagnostics) and leave the remainder of effects, those near zero that are lined up (the trivial many, normally distributed) for the error pool. This is really handy for all factorials, but especially ones that are unreplicated.

(Learn more about factorials by attending the two-day computer-intensive workshop "Experiment Design Made Easy." See for a description of this class and link from this page to the course outline and schedule. Then, if you like, enroll online.)


2. Info Alert: Practical considerations for the implementation of design of experiments in quality by design (QBD)

The June issue of Bioprocess International features an article on "Practical Considerations for the Implementation of Design of Experiments in Quality by Design" by Mahesh Shivhare, PhD, statistician and Graham McCreath, PhD, head of process design for Avecia Biologics, Billingham, UK. Here's the abstract: "Quality by Design (QbD) is a systematic approach to bioprocess development of pharmaceutical products involving the use of science-based understanding, quality risk management, and statistically designed experiments. By defining target product quality profiles and applying QbD approaches, product critical quality attributes (CQA) can be identified and manufacturing processes can be developed that maximize control and minimize variability. Statistical design of experiment (DoE) has a major role to play in QbD by allowing the relationships between critical process parameters (CPP), material attributes, and CQAs to be identified and understood. However, to extract the most relevant information from DoE studies and to avoid potential problems of misidentification of important process parameters, a careful consideration of the experimental objectives, design, and interpretation has to be made. The authors present examples of best practice in DoE to avoid such problems that should ultimately contribute to correct identification of a process 'Design Space.'"

The latest version 8 of Design-Expert software, posted* at for free trial evaluation, provides a feature of particular interest to biopharmaceutical experimenters aiming for Quality by Design (QbD): Graphical optimization now frames the design space where all modeled responses fall within confidence, prediction or tolerance intervals (user choice). Check it out!

* This web site also provides free patches to update older licensed versions of 8.0.

(Learn how to apply response surface method (RSM) tools for mapping out design space for QbD by attending the two-day computer-intensive workshop "Designed Experiments for Life Sciences." For a complete course description, see Link from this page to the course outline and schedule. Then, if you like, enroll online.)


