If you want to influence, help, change, make, affect, increase, decrease, prevent, produce, or trigger some action, you are interested in causing an effect. For example, if you want to help people, you need to use treatments that cause good effects and that prevent bad effects. So, if you were giving someone a treatment that had been researched, you would prefer that the study suggesting that the treatment was effective was an internally valid study.
1. The alleged cause and the alleged effect are correlated: Changes in the causal variable are associated with changes in the effectvariable. (If the variables are not related, they can not be causally related.)
2. Changes in the alleged cause come before changes in the alleged effect (If changes in what you are calling the cause come after changes in what you call the effect, what you think is the cause may really be the effect--;and what you think is the effect may really be the cause. Put another way, what you think is a cause may only be a symptom).
3. No other factor could account for the relationship between the alleged cause and the alleged effect (If you cannot rule out the many possible third variables--;also known as;lurking variables--;both of your variables may be symptoms/side effects of some other variable. For example, ice cream consumption is correlated with shark attacks, but both may be effects of warmer weather.
Category #1--;Problems due to comparing one group of participants against a different group or subgroup of participants:
Category #2-- Factors other than the treatment that may change participants:
Category #3--Factors that may cause scores to change even though participants have not changed:
History refers to changes that occur outside the participant (in the participant's environment); maturation refers to changes inside the participant.
Testing and instrumentation both refer to participants' scores changing from one measurement to the next for factors unrelated to the treatment.
Instrumentation refers to changes in scores being due to changes in the measuring instrument or in how participants are scored. For example, if after the pretest, the questionnaire is revised, the scoring system is refined, or raters are trained, changes in posttest scores may reflect changes in the instrument rather than changes in the participant.
Testing, on the other hand, refers to changes in the participant due to the experience of being measured. A simple example of testing would be to take a trivia test one day and then take the same test the next day. If you researched the questions after taking the test the first time, you would do better the second time. In this case, the act of taking the first test changed you. In this class, you try to take advantage of the testing effect by taking the online practice quizzes--and by studying these review questions.
In short, the difference is that in testing, participants really have changed--taking the first test caused them to learn something which caused participants to behave and score differently on the retest. In instrumentation, on the other hand, the participants haven't changed--;their scores change only because the measuring instrument has changed.
Having different groups get different treatments avoids threats like history (other than the treatment, the different groups should experience the same outside events), maturation (both groups have the same time to mature), instrumentation (participants are often only measured once by the same instrument), and testing (both groups are usually measured the same number of times--once). .
Although mortality can be a problem, the biggest problem is usually selection: Because people are different, comparing your two groups may be like comparing apples with oranges. That is, they may have been different before the treatment was introduced.
You are not making groups equal. Instead, you are making sure the groups differ in at least one way. They probably differ in other ways as well.
The groups will probably not differ in terms of History, Maturation, Instrumentation, and Testing.
By comparing each participant with herself, you are comparing apples with apples, so you do not have to worry about either Selection or Selection by Maturation
History, Maturation, Instrumentation, Testing, Regression, and Mortality
When you are selecting participants based on their extreme scores and your measure is unreliable.
When you had matched on pretest scores, but (a) there was time for participants to change from pretest to posttest and (b) the groups differed in ways that might affect maturation (e.g., they differed in terms of age or gender).
When no participants withdraw or are withdrawn from the study.
In trying to keep nontreatment factors constant, the researcher may lose the ability to generalize to situations or participants that differ from the narrow range of situations and participants studied. For example, to help internal validity, a researcher might study participants who do not vary much from each other (white, male mice who are 180 days old or pairs of identical twins) under tightly controlled laboratory conditions. Would the results apply to other, more diverse populations? To less tightly controlled real world settings?
By using random assignment, you can do an internally valid study.
For practice identifying internal validity problems with designs that do not use random assignment, click here.