版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡介
..2-.v.?計(jì)算機(jī)網(wǎng)絡(luò)?報(bào)告題目:出處:班級:學(xué)號:XX:SpeechrecognitionHistoryOneofthemostnotabledomainsforthemercialapplicationofspeechrecognitionintheUnitedStateshasbeenhealthcareandinparticulartheworkofthemedicaltranscriptionist.Accordingtoindustryexperts,atitsinception,speechrecognition(SR)wassoldasawaytopletelyeliminatetranscriptionratherthanmakethetranscriptionprocessmoreefficient,henceitwasnotaccepted.ItwasalsothecasethatSRatthattimewasoftentechnicallydeficient.Additionally,tobeusedeffectively,itrequiredchangestothewaysphysiciansworkedanddocumentedclinicalencounters,whichmanyifnotallwerereluctanttodo.Thebiggestlimitationtospeechrecognitionautomatingtranscription,however,isseenasthesoftware.Thenatureofnarrativedictationishighlyinterpretiveandoftenrequiresjudgmentthatmaybeprovidedbyarealhumanbutnotyetbyanautomatedsystem.Anotherlimitationhasbeentheextensiveamountoftimerequiredbytheuserand/orsystemprovidertotrainthesoftware.AdistinctioninASRisoftenmadebetween"artificialsyntaxsystems"whichareusuallydomain-specificand"naturallanguageprocessing"whichisusuallylanguage-specific.Eachofthesetypesofapplicationpresentsitsownparticulargoalsandchallenges.ApplicationsHealthcareInthehealthcaredomain,eveninthewakeofimprovingspeechrecognitiontechnologies,medicaltranscriptionists(MTs)havenotyetbeeobsolete.Manyexpertsinthefieldanticipatethatwithincreaseduseofspeechrecognitiontechnology,theservicesprovidedmayberedistributedratherthanreplaced.Speechrecognitioncanbeimplementedinfront-endorback-endofthemedicaldocumentationprocess.Front-EndSRiswheretheproviderdictatesintoaspeech-recognitionengine,therecognizedwordsaredisplayedrightaftertheyarespoken,andthedictatorisresponsibleforeditingandsigningoffonthedocument.Itnevergoesthroughaneditor.Back-EndSRiswheretheproviderdictatesintoadigitaldictationsystem,andthevoiceisroutedthroughaspeech-recognitionmachineandtherecognizeddraftdocumentisroutedalongwiththeoriginalvoicefiletotheeditor,whoeditsthedraftandfinalizesthereport.DeferredSRisbeingwidelyusedintheindustrycurrently.ManyElectronicMedicalRecords(EMR)applicationscanbemoreeffectiveandmaybeperformedmoreeasilywhendeployedinconjunctionwithaspeech-recognitionengine.Searches,queries,andformfillingmayallbefastertoperformbyvoicethanbyusingakeyboard.MilitaryHigh-performancefighteraircraftSubstantialeffortshavebeendevotedinthelastdecadetothetestandevaluationofspeechrecognitioninfighteraircraft.OfparticularnotearetheU.S.programinspeechrecognitionfortheAdvancedFighterTechnologyIntegration(AFTI)/F-16aircraft(F-16VISTA),theprograminFranceoninstallingspeechrecognitionsystemsonMirageaircraft,andprogramsintheUKdealingwithavarietyofaircraftplatforms.Intheseprograms,speechrecognizershavebeenoperatedsuccessfullyinfighteraircraftwithapplicationsincluding:settingradiofrequencies,mandinganautopilotsystem,settingsteer-pointcoordinatesandweaponsreleaseparameters,andcontrollingflightdisplays.Generally,onlyverylimited,constrainedvocabularieshavebeenusedsuccessfully,andamajorefforthasbeendevotedtointegrationofthespeechrecognizerwiththeavionicssystem.Someimportantconclusionsfromtheworkwereasfollows:Speechrecognitionhasdefinitepotentialforreducingpilotworkload,butthispotentialwasnotrealizedconsistently.Achievementofveryhighrecognitionaccuracy(95%ormore)wasthemostcriticalfactorformakingthespeechrecognitionsystemuseful
—withlowerrecognitionrates,pilotswouldnotusethesystem.Morenaturalvocabularyandgrammar,andshortertrainingtimeswouldbeuseful,butonlyifveryhighrecognitionratescouldbemaintained.Laboratoryresearchinrobustspeechrecognitionformilitaryenvironmentshasproducedpromisingresultswhich,ifextendabletothecockpit,shouldimprovetheutilityofspeechrecognitioninhigh-performanceaircraft.WorkingwithSwedishpilotsflyingintheJAS-39Gripencockpit,Englund(2004)foundrecognitiondeterioratedwithincreasingG-loads.Itwasalsoconcludedthatadaptationgreatlyimprovedtheresultsinallcasesandintroducingmodelsforbreathingwasshowntoimproverecognitionscoressignificantly.Contrarytowhatmightbeexpected,noeffectsofthebrokenEnglishofthespeakerswerefound.Itwasevidentthatspontaneousspeechcausedproblemsfortherecognizer,ascouldbeexpected.Arestrictedvocabulary,andaboveall,apropersyntax,couldthusbeexpectedtoimproverecognitionaccuracysubstantially.TheEurofighterTyphooncurrentlyinservicewiththeUKRAFemploysaspeaker-dependentsystem,i.e.itrequireseachpilottocreateatemplate.Thesystemisnotusedforanysafetycriticalorweaponcriticaltasks,suchasweaponreleaseorloweringoftheundercarriage,butisusedforawiderangeofothercockpitfunctions.Voicemandsareconfirmedbyvisualand/orauralfeedback.Thesystemisseenasamajordesignfeatureinthereductionofpilotworkload,andevenallowsthepilottoassigntargetstohimselfwithtwosimplevoicemandsortoanyofhiswingmenwithonlyfivemands.HelicoptersTheproblemsofachievinghighrecognitionaccuracyunderstressandnoisepertainstronglytothehelicopterenvironmentaswellastothefighterenvironment.Theacousticnoiseproblemisactuallymoresevereinthehelicopterenvironment,notonlybecauseofthehighnoiselevelsbutalsobecausethehelicopterpilotgenerallydoesnotwearafacemask,whichwouldreduceacousticnoiseinthemicrophone.Substantialtestandevaluationprogramshavebeencarriedoutinthepastdecadeinspeechrecognitionsystemsapplicationsinhelicopters,notablybytheU.S.ArmyAvionicsResearchandDevelopmentActivity(AVRADA)andbytheRoyalAerospaceEstablishment(RAE)intheUK.WorkinFrancehasincludedspeechrecognitioninthePumahelicopter.TherehasalsobeenmuchusefulworkinCanada.Resultshavebeenencouraging,andvoiceapplicationshaveincluded:controlofmunicationradios;settingofnavigationsystems;andcontrolofanautomatedtargethandoversystem.Asinfighterapplications,theoverridingissueforvoiceinhelicoptersistheimpactonpiloteffectiveness.EncouragingresultsarereportedfortheAVRADAtests,althoughtheserepresentonlyafeasibilitydemonstrationinatestenvironment.Muchremainstobedonebothinspeechrecognitionandinoverallspeechrecognitiontechnology,inordertoconsistentlyachieveperformanceimprovementsinoperationalsettings.BattlemanagementBattlemanagementmandcentresgenerallyrequirerapidaccesstoandcontroloflarge,rapidlychanginginformationdatabases.mandersandsystemoperatorsneedtoquerythesedatabasesasconvenientlyaspossible,inaneyes-busyenvironmentwheremuchoftheinformationispresentedinadisplayformat.Humanmachineinteractionbyvoicehasthepotentialtobeveryusefulintheseenvironments.Anumberofeffortshavebeenundertakentointerfacemerciallyavailableisolated-wordrecognizersintobattlemanagementenvironments.Inonefeasibilitystudy,speechrecognitionequipmentwastestedinconjunctionwithanintegratedinformationdisplayfornavalbattlemanagementapplications.Userswereveryoptimisticaboutthepotentialofthesystem,althoughcapabilitieswerelimited.SpeechunderstandingprogramssponsoredbytheDefenseAdvancedResearchProjectsAgency(DARPA)intheU.S.hasfocusedonthisproblemofnaturalspeechinterface..Speechrecognitioneffortshavefocusedonadatabaseofcontinuousspeechrecognition(CSR),large-vocabularyspeechwhichisdesignedtoberepresentativeofthenavalresourcemanagementtask.Significantadvancesinthestate-of-the-artinCSRhavebeenachieved,andcurrenteffortsarefocusedonintegratingspeechrecognitionandnaturallanguageprocessingtoallowspokenlanguageinteractionwithanavalresourcemanagementsystem.TrainingairtrafficcontrollersTrainingformilitary(orcivilian)airtrafficcontrollers(ATC)representsanexcellentapplicationforspeechrecognitionsystems.ManyATCtrainingsystemscurrentlyrequireapersontoactasa"pseudo-pilot",engaginginavoicedialogwiththetraineecontroller,whichsimulatesthedialogwhichthecontrollerwouldhavetoconductwithpilotsinarealATCsituation.Speechrecognitionandsynthesistechniquesofferthepotentialtoeliminatetheneedforapersontoactaspseudo-pilot,thusreducingtrainingandsupportpersonnel.Aircontrollertasksarealsocharacterizedbyhighlystructuredspeechastheprimaryoutputofthecontroller,hencereducingthedifficultyofthespeechrecognitiontask.TheU.S.NavalTrainingEquipmentCenterhassponsoredanumberofdevelopmentsofprototypeATCtrainersusingspeechrecognition.Generally,therecognitionaccuracyfallsshortofprovidinggracefulinteractionbetweenthetraineeandthesystem.However,theprototypetrainingsystemshavedemonstratedasignificantpotentialforvoiceinteractioninthesesystems,andinothertrainingapplications.TheU.S.Navyhassponsoredalarge-scaleeffortinATCtrainingsystems,whereamercialspeechrecognitionunitwasintegratedwithaplextrainingsystemincludingdisplaysandscenariocreation.Althoughtherecognizerwasconstrainedinvocabulary,oneofthegoalsofthetrainingprogramswastoteachthecontrollerstospeakinaconstrainedlanguage,usingspecificvocabularyspecificallydesignedfortheATCtask.ResearchinFrancehasfocusedontheapplicationofspeechrecognitioninATCtrainingsystems,directedatissuesbothinspeechrecognitionandinapplicationoftask-domaingrammarconstraints.TheUSAF,USMC,AnotherapproachtoATCsimulationwithspeechrecognitionhasbeencreatedbySupremis.TheSupremissystemisnotconstrainedbyrigidgrammarsimposedbytheunderlyinglimitationsofotherrecognitionstrategies.TelephonyandotherdomainsASRinthefieldoftelephonyisnowmonplaceandinthefieldofputergamingandsimulationisbeingmorewidespread.Despitethehighlevelofintegrationwithwordprocessingingeneralpersonalputing,however,ASRinthefieldofdocumentproductionhasnotseentheexpectedincreasesinuse.Theimprovementofmobileprocessorspeedsmadefeasiblethespeech-enabledSymbianandWindowsMobileSmartphones.Currentspeech-to-textprogramsaretoolargeandrequiretoomuchCPUpowertobepracticalforthePocketPC.SpeechisusedmostlyasapartofUserInterface,forcreatingpre-definedorcustomspeechmands.Leadingsoftwarevendorsinthisfieldare:MicrosoftCorporation(MicrosoftVoicemand);Nuancemunications(NuanceVoiceControl);VitoTechnology(VITOVoice2Go);SpeereoSoftware(SpeereoVoiceTranslator).MyCaptionforBlackBerry.PeoplewithDisabilitiesPeoplewithdisabilitiesareanotherpartofthepopulationthatbenefitfromusingspeechrecognitionprograms.Itisespeciallyusefulforpeoplewhohavedifficultywithorareunabletousetheirhands,frommildrepetitivestressinjuriestoinvolveddisabilitiesthatrequirealternativeinputforsupportwithaccessingtheputer.Infact,peoplewhousedthekeyboardalotanddevelopedRSIbecameanurgentearlymarketforspeechrecognition.Speechrecognitionisusedindeaftelephony,suchasspinvoxvoice-to-textvoicemail,relayservices,and\o"TelemunicationsRelayService"captionedtelephone.Individualswithlearningdisabilitieswhohaveproblemswiththoughttopapermunication(essentiallytheythinkofanideabutitisprocessedincorrectlycausingittoendupdifferentlyonpaper)canbenefitfromthesoftwareasithelpstooverlapthatweakness....v.語音識別開展歷史商業(yè)應(yīng)用的語音識別在美國是保健和medicaltranscriptionist等最顯著的領(lǐng)域之一。據(jù)業(yè)界專家稱,在其成立以來,語音識別〔簡稱SR〕是為了徹底消除轉(zhuǎn)錄,而不是使轉(zhuǎn)錄過程更有效率,因此是不能承受的。也有人認(rèn)為,SR在那個時候往往是在技術(shù)上缺乏。此外,為了有效地加以利用,它需要改變工作方式和記錄醫(yī)師臨床實(shí)踐,其中甚至有許多不是所有的人愿意做的事。然而,語音識別自動化轉(zhuǎn)錄,被看作是該軟件最大的限制。記敘一件事的本質(zhì)是在聽寫時非常需要解釋和判斷,這需要一個真正的人,而不是通過自動化系統(tǒng)。另一個限制是用戶和或系統(tǒng)供給商,培訓(xùn)軟件所需的時間普遍的比擬長。ASR之間的區(qū)分往往是特定領(lǐng)域的“人工語法系統(tǒng)〞和特定于語言的“自然語言處理〞。所有這些類型的應(yīng)用都有自己的特定目標(biāo)和挑戰(zhàn)。應(yīng)用衛(wèi)生保健在衛(wèi)生保健領(lǐng)域,即使語音識別技術(shù)不斷改良,medicaltranscriptionists〔MTS〕還是不會過時。許多該領(lǐng)域的專家預(yù)測,隨著語音識別技術(shù)的廣泛使用,MTS提供的效勞可能會重新分配,而不是取代。語音識別可以在醫(yī)療文件的進(jìn)程前端或后端實(shí)施。SR的前端是要求供給商裝入語音識別引擎,在他們說完話后正確顯示出來,使用者負(fù)責(zé)編輯,并簽署了有關(guān)文件。它從未審閱編輯。后端SR是提供者裝入一個數(shù)字聽寫轉(zhuǎn)換系統(tǒng)的地方,聲音通過語言識別機(jī)器尋址,并且被認(rèn)可的草案與原始的聲音文件一起給編輯者,編輯草稿并且完成報(bào)告。延期的SR是在當(dāng)前產(chǎn)業(yè)用途廣泛。當(dāng)部署與語言識別引擎一道,許多電子病歷(EMR)應(yīng)用是更加有效的,并且也更加容易地執(zhí)行。查尋、詢問和填表全部由聲音執(zhí)行比通過使用鍵盤更快速。軍事高性能戰(zhàn)斗機(jī)在過去十年中我們都花了大量的努力來測試和評價語音識別的戰(zhàn)斗機(jī)。特別值得注意的是,美國方案為先進(jìn)戰(zhàn)斗機(jī)〔AFTI〕/F-16戰(zhàn)斗機(jī)〔F-16型Vista〕安裝語音識別的集成技術(shù),法國也方案在幻影飛機(jī)安裝語音識別系統(tǒng),英國方案安裝在不同的飛機(jī)平臺。這些程序中,成功在戰(zhàn)斗機(jī)中應(yīng)用語音識別的包括:設(shè)置無線電頻率,指揮一個自動系統(tǒng),設(shè)置引導(dǎo)點(diǎn)坐標(biāo)和武器釋放參數(shù),并控制飛行顯示器。通常,只非常順利地使用了有限,拘束的詞匯量,并且主要努力致力于語音識別與航空電子學(xué)系統(tǒng)的綜合化。一些重要結(jié)論的工作如下:1.語音識別具有一定的潛力,減少飛行員的工作量,但這種潛力始終沒能實(shí)現(xiàn)。2.實(shí)現(xiàn)非常高的識別率〔95%或以上〕是使語音識別系統(tǒng)有用和識別速率降低的最關(guān)鍵因素,飛行員不會使用該系統(tǒng)。3.更自然的詞匯和語法,和更短的培訓(xùn)時間將是有益的,但前提是非常高的識別率可維持不變。實(shí)驗(yàn)室研究的在軍事環(huán)境中抗噪語音識別產(chǎn)生了可喜的成果,如果擴(kuò)展到駕駛艙,應(yīng)能改良應(yīng)用語音識別的高性能飛機(jī)。在JAS-39Gripen駕駛艙內(nèi)的瑞典飛行員Englund(2004)發(fā)現(xiàn)了公認(rèn)惡化隨著G過載。也完畢適應(yīng)在所有的情況下很大地改良了結(jié)果,并且介紹呼吸的模型顯示極大改良公認(rèn)比分。相反,可以預(yù)期,沒有任何作用的蹩腳的英語發(fā)言者被發(fā)現(xiàn)。很明顯,可以預(yù)期,造成問題的自發(fā)講話的語音識別。受限的詞匯,最重要的是,一個適當(dāng)?shù)恼Z法,能因而期望極大地改良公認(rèn)準(zhǔn)確性。目前在英國皇家空軍擁有揚(yáng)聲器依賴系統(tǒng)歐洲臺風(fēng),它要求每個試點(diǎn)創(chuàng)立一個模板。該系統(tǒng)不作任何平安關(guān)鍵或核武器的關(guān)鍵任務(wù),如核武器釋放或降低腳架,但廣泛應(yīng)用于其他駕駛艙職能。語音指令通過視覺和聽覺反應(yīng)來確認(rèn)。該系統(tǒng)被看作是一個主要的設(shè)計(jì)功能,減少飛行員的工作量,甚至可以以自己的兩個簡單的語音命令,或任何與他的飛行員只有五個命令改變試點(diǎn)的目標(biāo)。直升機(jī)在高壓和強(qiáng)烈噪音干擾的直升機(jī)環(huán)境實(shí)現(xiàn)高識別率的問題和戰(zhàn)斗機(jī)環(huán)境一眼好。實(shí)際上,在直升機(jī)的環(huán)境,噪音問題更為嚴(yán)重,不僅是因?yàn)楦咴胍羲剑疫€因?yàn)?,直升機(jī)飛行員一般不戴減少麥克風(fēng)噪音的口罩。語音識別系統(tǒng)在過去十年中已經(jīng)進(jìn)展了大量的試驗(yàn)和評估程序并應(yīng)用在直升機(jī),特別是美國陸軍航空電子研究和開展活動〔AVRADA〕和英國的皇家航空建立〔評審〕。在法國的美洲豹直升機(jī)也安裝了語音識別系統(tǒng)。在加拿大也有許多有益的工作。結(jié)果令人鼓舞,語音應(yīng)用包括:無線電通信控制;設(shè)置的導(dǎo)航系統(tǒng);和控制的自動化目標(biāo)移交制度。如在戰(zhàn)斗機(jī)的應(yīng)用,首要的問題是直升機(jī)的噪聲影響。
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。
最新文檔
- 2025版羅馬柱裝飾工程總承包合同4篇
- 二零二五版在建工程抵押擔(dān)保合同模板3篇
- 2025版?zhèn)€人汽車轉(zhuǎn)讓及二手車交易平臺合作與售后服務(wù)合同4篇
- 2025年度落水管施工工程保險與理賠合同4篇
- 二零二五年度健康醫(yī)療大數(shù)據(jù)安全保障合作協(xié)議4篇
- 二零二五版股權(quán)回購項(xiàng)目擔(dān)保及投資決策合同3篇
- 2025年食用菌種植基地與銷售渠道聯(lián)盟合同2篇
- 二零二五年度廣告公司廣告活動策劃合同3篇
- 2025年高速公路車輛運(yùn)輸通行費(fèi)結(jié)算協(xié)議范本4篇
- 2024版消防系統(tǒng)維保合同范本
- 勞務(wù)協(xié)議范本模板
- 人教版(2024)數(shù)學(xué)七年級上冊期末測試卷(含答案)
- 2024年國家保密培訓(xùn)
- 2024年公務(wù)員職務(wù)任命書3篇
- CFM56-3發(fā)動機(jī)構(gòu)造課件
- 會議讀書交流分享匯報(bào)課件-《殺死一只知更鳥》
- 2025屆撫州市高一上數(shù)學(xué)期末綜合測試試題含解析
- 公司印章管理登記使用臺賬表
- 磚廠承包合同簽訂轉(zhuǎn)讓合同
- 思政課國內(nèi)外研究現(xiàn)狀分析
- 2023年公務(wù)員多省聯(lián)考《申論》題(廣西B卷)
評論
0/150
提交評論