Pilot study, reliability test(New)2_第1頁
Pilot study, reliability test(New)2_第2頁
Pilot study, reliability test(New)2_第3頁
Pilot study, reliability test(New)2_第4頁
Pilot study, reliability test(New)2_第5頁
已閱讀5頁,還剩25頁未讀 繼續(xù)免費(fèi)閱讀

下載本文檔

版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進(jìn)行舉報或認(rèn)領(lǐng)

文檔簡介

1、Pilot study & reliability test 2013.04 Jue Wang 1.Pilot experiment nA pilot experiment, also called a pilot study, is a small scale preliminary study conducted in order to evaluate feasibility, time, cost, adverse events, and effect size in an attempt to predict an appropriate sample size and improv

2、e upon the study design prior to performance of a full-scale research project. Implementation of pilot studies nPilot experiments are frequently carried out before large- scale quantitative research, in an attempt to avoid time and money being wasted on an inadequately designed project. A pilot stud

3、y is usually carried out on members of the relevant population, but not on those who will form part of the final sample. This is because it may influence the later behavior of research subjects if they have already been involved in the research. nA pilot study is often used to test the design of the

4、 full- scale experiment which then can be adjusted. It is a potentially valuable insight and should anything be missing in the pilot study it can be added to the full-scale experiment to improve the chances of a clear outcome. Applications nOften in engineering applications, pilot experiments are us

5、ed to sell a product and provide quantitative proof that the system has potential to succeed on a full scale basis. Pilot experiments are also used to reduce cost, as they are less expensive than full experiments. If there is not enough reason to provide full scale applications, pilot studies can ge

6、nerally provide this proof. Applications nIn sociology, pilot studies can be referred to as small-scale studies that will help identify design issues before the main research is done. 2. Reliability nIn the psychometrics, reliability is used to describe the overall consistency of a measure. A measur

7、e is said to have a high reliability if it produces similar results under consistent conditions. For example, measurements of peoples height and weight are often extremely reliable. Reliability nWill people answer the same question in the same way on different occasion? Difference from validity nRel

8、iability does not imply validity. That is, a reliable measure that is measuring something consistently, may not be measuring what you want to be measuring. Difference from validity nWhile reliability does not imply validity, a lack of reliability does place a limit on the overall validity of a test.

9、 A test that is not perfectly reliable cannot be perfectly valid. While a reliable test may provide useful valid information, a test that is not reliable cannot possibly be valid. Difference from validity nAn example often used to illustrate the difference between reliability and validity in the exp

10、erimental sciences involves a common bathroom scale. If someone who is 200 pounds steps on a scale 10 times and gets readings of 15, 250, 95, 140, etc., the scale is not reliable. If the scale consistently reads 150, then it is reliable, but not valid. If it reads 200 each time, then the measurement

11、 is both reliable and valid. General model nIn practice, testing measures are never perfectly consistent. Theories of test reliability have been developed to estimate the effects of inconsistency on the accuracy of measurement. The basic starting point for almost all theories of test reliability is

12、the idea that test scores reflect the influence of two sorts of factors: 1. Factors that contribute to consistency: stable characteristics of the individual or the attribute that one is trying to measure 2. Factors that contribute to inconsistency: features of the individual or the situation that ca

13、n affect test scores but have nothing to do with the attribute being measured General model nSome of these inconsistencies include: Temporary but general characteristics of the individual: health, fatigue, motivation, emotional strain Temporary and specific characteristics of individual: comprehensi

14、on of the specific test task, specific tricks or techniques of dealing with the particular test materials, fluctuations of memory, attention or accuracy Aspects of the testing situation: freedom from distractions, clarity of instructions, interaction of personality, sex, or race of examiner Chance f

15、actors: luck in selection of answers by sheer guessing, momentary distractions nThe goal of estimating reliability is to determine how much of the variability in test scores is due to errors in measurement and how much is due to variability in true scores. True score nA true score is the replicable

16、feature of the concept being measured. It is the part of the observed score that would recur across different measurement occasions in the absence of error. nErrors of measurement are composed of both random error and systematic error. It represents the discrepancies between scores obtained on tests

17、 and the corresponding true scores. nThis conceptual breakdown is typically represented by the simple equation: Classical test theory nReliability theory shows that the variance of obtained scores is simply the sum of the variance of true scores plus the variance of errors of measurement. nThis equa

18、tion suggests that test scores vary as the result of two factors: 1. Variability in true scores 2. Variability due to errors of measurement. 222 XTE Classical test theory nThe reliability coefficient provides an index of the relative influence of true and error scores on attained test scores. In its

19、 general form, the reliability coefficient is defined as the ratio of true score variance to the total variance of test scores. Or, equivalently, one minus the ratio of the variation of the error score and the variation of the observed score: nUnfortunately, there is no way to directly observe or ca

20、lculate the true score, so a variety of methods are used to estimate the reliability of a test. 22 22 1 TE xx XX Estimation nThe goal of estimating reliability is to determine how much of the variability in test scores is due to errors in measurement and how much is due to variability in true scores

21、. nFour practical strategies have been developed that provide workable methods of estimating test reliability. Test-retest reliability method Parallel-forms method Split-half method Internal consistency Cronbachs alpha nIn statistics, Cronbachs (alpha) is a coefficient of reliability. It is commonly used as a measure of the internal consistency or reliability of a psychometric test score for a sample of examinees. It was first named alpha by Lee Cronbach in 1951, as he had intended to continue with further coefficients. nCronbachs alpha statistic is wid

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
  • 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。

評論

0/150

提交評論