版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進(jìn)行舉報或認(rèn)領(lǐng)
文檔簡介
1、Pilot study & reliability test 2013.04 Jue Wang 1.Pilot experiment nA pilot experiment, also called a pilot study, is a small scale preliminary study conducted in order to evaluate feasibility, time, cost, adverse events, and effect size in an attempt to predict an appropriate sample size and improv
2、e upon the study design prior to performance of a full-scale research project. Implementation of pilot studies nPilot experiments are frequently carried out before large- scale quantitative research, in an attempt to avoid time and money being wasted on an inadequately designed project. A pilot stud
3、y is usually carried out on members of the relevant population, but not on those who will form part of the final sample. This is because it may influence the later behavior of research subjects if they have already been involved in the research. nA pilot study is often used to test the design of the
4、 full- scale experiment which then can be adjusted. It is a potentially valuable insight and should anything be missing in the pilot study it can be added to the full-scale experiment to improve the chances of a clear outcome. Applications nOften in engineering applications, pilot experiments are us
5、ed to sell a product and provide quantitative proof that the system has potential to succeed on a full scale basis. Pilot experiments are also used to reduce cost, as they are less expensive than full experiments. If there is not enough reason to provide full scale applications, pilot studies can ge
6、nerally provide this proof. Applications nIn sociology, pilot studies can be referred to as small-scale studies that will help identify design issues before the main research is done. 2. Reliability nIn the psychometrics, reliability is used to describe the overall consistency of a measure. A measur
7、e is said to have a high reliability if it produces similar results under consistent conditions. For example, measurements of peoples height and weight are often extremely reliable. Reliability nWill people answer the same question in the same way on different occasion? Difference from validity nRel
8、iability does not imply validity. That is, a reliable measure that is measuring something consistently, may not be measuring what you want to be measuring. Difference from validity nWhile reliability does not imply validity, a lack of reliability does place a limit on the overall validity of a test.
9、 A test that is not perfectly reliable cannot be perfectly valid. While a reliable test may provide useful valid information, a test that is not reliable cannot possibly be valid. Difference from validity nAn example often used to illustrate the difference between reliability and validity in the exp
10、erimental sciences involves a common bathroom scale. If someone who is 200 pounds steps on a scale 10 times and gets readings of 15, 250, 95, 140, etc., the scale is not reliable. If the scale consistently reads 150, then it is reliable, but not valid. If it reads 200 each time, then the measurement
11、 is both reliable and valid. General model nIn practice, testing measures are never perfectly consistent. Theories of test reliability have been developed to estimate the effects of inconsistency on the accuracy of measurement. The basic starting point for almost all theories of test reliability is
12、the idea that test scores reflect the influence of two sorts of factors: 1. Factors that contribute to consistency: stable characteristics of the individual or the attribute that one is trying to measure 2. Factors that contribute to inconsistency: features of the individual or the situation that ca
13、n affect test scores but have nothing to do with the attribute being measured General model nSome of these inconsistencies include: Temporary but general characteristics of the individual: health, fatigue, motivation, emotional strain Temporary and specific characteristics of individual: comprehensi
14、on of the specific test task, specific tricks or techniques of dealing with the particular test materials, fluctuations of memory, attention or accuracy Aspects of the testing situation: freedom from distractions, clarity of instructions, interaction of personality, sex, or race of examiner Chance f
15、actors: luck in selection of answers by sheer guessing, momentary distractions nThe goal of estimating reliability is to determine how much of the variability in test scores is due to errors in measurement and how much is due to variability in true scores. True score nA true score is the replicable
16、feature of the concept being measured. It is the part of the observed score that would recur across different measurement occasions in the absence of error. nErrors of measurement are composed of both random error and systematic error. It represents the discrepancies between scores obtained on tests
17、 and the corresponding true scores. nThis conceptual breakdown is typically represented by the simple equation: Classical test theory nReliability theory shows that the variance of obtained scores is simply the sum of the variance of true scores plus the variance of errors of measurement. nThis equa
18、tion suggests that test scores vary as the result of two factors: 1. Variability in true scores 2. Variability due to errors of measurement. 222 XTE Classical test theory nThe reliability coefficient provides an index of the relative influence of true and error scores on attained test scores. In its
19、 general form, the reliability coefficient is defined as the ratio of true score variance to the total variance of test scores. Or, equivalently, one minus the ratio of the variation of the error score and the variation of the observed score: nUnfortunately, there is no way to directly observe or ca
20、lculate the true score, so a variety of methods are used to estimate the reliability of a test. 22 22 1 TE xx XX Estimation nThe goal of estimating reliability is to determine how much of the variability in test scores is due to errors in measurement and how much is due to variability in true scores
21、. nFour practical strategies have been developed that provide workable methods of estimating test reliability. Test-retest reliability method Parallel-forms method Split-half method Internal consistency Cronbachs alpha nIn statistics, Cronbachs (alpha) is a coefficient of reliability. It is commonly used as a measure of the internal consistency or reliability of a psychometric test score for a sample of examinees. It was first named alpha by Lee Cronbach in 1951, as he had intended to continue with further coefficients. nCronbachs alpha statistic is wid
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。
最新文檔
- 2025-2030全球3D生物打印植入物行業(yè)調(diào)研及趨勢分析報告
- 2024年軍隊文職人員招聘考試題庫二
- 2025年度旅游產(chǎn)業(yè)轉(zhuǎn)型升級個人咨詢服務(wù)協(xié)議
- 2025版文化產(chǎn)業(yè)投資合作開發(fā)協(xié)議3篇
- 2025版住宅小區(qū)物業(yè)委托維護(hù)管理協(xié)議3篇
- 二零二五年度藝術(shù)場地租賃合同中的藝術(shù)創(chuàng)作與展覽指導(dǎo)2篇
- 二零二五年度阿拉爾經(jīng)濟(jì)技術(shù)開發(fā)區(qū)環(huán)保產(chǎn)業(yè)合作開發(fā)合同3篇
- 2024版影視器材租賃合同下載
- 2025版房地產(chǎn)銷售合同標(biāo)準(zhǔn)模板
- 2024糯玉米采購協(xié)議書
- 印度與阿拉伯的數(shù)學(xué)
- 會陰切開傷口裂開的護(hù)理查房
- 《鋼鐵是怎樣煉成的》選擇題100題(含答案)
- 實驗報告·測定雞蛋殼中碳酸鈣的質(zhì)量分?jǐn)?shù)
- 部編版小學(xué)語文五年級下冊集體備課教材分析主講
- 電氣設(shè)備建筑安裝施工圖集
- 《工程結(jié)構(gòu)抗震設(shè)計》課件 第10章-地下建筑抗震設(shè)計
- 公司法務(wù)部工作細(xì)則(草案)
- 第18課《文言文二則 鐵杵成針》(學(xué)習(xí)任務(wù)單)- 四年級語文下冊部編版
- 《功能材料概論》期末考試試卷及參考答案2023年12月
- 機(jī)器設(shè)備抵押合同
評論
0/150
提交評論