版權說明:本文檔由用戶提供并上傳,收益歸屬內容提供方,若內容存在侵權,請進行舉報或認領
文檔簡介
ParallelProgrammingInstructor:ZhangWeizhe(張偉哲)ComputerNetworkandInformationSecurityTechniqueResearchCenter,SchoolofComputerScienceandTechnology,HarbinInstituteofTechnologyCELL處理器及其編程模型3提綱
ArchitectureoftheCell單元架構
Cellprogrammingmodel單元編程模型
TheIBMFullSystemSimulatorIBM全系統(tǒng)模擬器Challenges:ThreeMajorLimiterstoProcessorPerformanceFrequencyWall頻率墻Diminishingreturnsfromdeeperpipelines從更深的管道減少返回MemoryWall記憶墻Processorfrequencyvs.DRAMmemorylatency處理器頻率與DRAM內存延遲Latencyintroducedbymultiplelevelsofmemory延遲由多層次的內存引入PowerWall電源墻LimitsinCMOStechnologyCMOS技術的局限性Hardlimittoacceptablesystempower硬限制到可接受的系統(tǒng)功率CellBE(CellBroadbandEngine)SolutionsIncreaseconcurrency增加并發(fā)性Multiplecores多核心SIMD/Vectoroperationsinacore在核心SIMD/向量運算Startmemorymovementearlysothatmemoryisavailablewhenneeded提前啟動記憶運動,以便在需要時記憶可用Increaseefficiency提高效率Simplercoresdevotemoreresourcestoactualcomputation更簡單的核心將更多的資源用于實際的計算Programmermanagedmemoryismoreefficientthandraggingdatathroughcaches程序員管理的內存比通過緩存拖動數據更有效率Largeregisterfilesgivethecompilermoreflexibilityandeliminatetransistorsneededforregisterrenaming大寄存器文件給編譯器更多的靈活性和消除需要的寄存器重命名晶體管Specializeprocessorcoresforspecifictasks為特定任務專門處理核心TheCellBEConceptCompatibilitywith64bPowerArchitecture?與64bPowerArchitecture?的兼容性BuildsonandleveragesIBMinvestmentandcommunity建立并利用IBM投資和社區(qū)Increasedefficiencyandperformance提高效率和性能Attacksonthe“PowerWall”攻擊“電墻”NonHomogenousCoherentMultiprocessor非同質相干多處理器Highdesignfrequency@alowoperatingvoltagewithadvancedpowermanagement高設計頻率@低工作電壓,并具有先進的電源管理功能Attacksonthe“MemoryWall”攻擊“記憶墻”StreamingDMAarchitecture流式DMA架構3-levelMemoryModel:MainStorage,LocalStorage,RegisterFiles3級內存模式:主存儲,本地存儲,注冊文件Attacksonthe“FrequencyWall”攻擊“頻率墻”Highlyoptimizedimplementation高度優(yōu)化的實現Largesharedregisterfilesandsoftwarecontrolledbranchingtoallowdeeperpipelines大型共享寄存器文件和軟件控制分支,以允許更深的管道CellBEBlockDiagramSPU:Synergisticprocessorunit協同處理器單元PPU:PowerPCprocessorunitMIC:memoryinterfacecontrollerBIC:businterfacecontrollerVMX:vector/SIMDmultimedia多媒體extensionPPEBlockDiagram支持兩個線程:基于Cache訪問MemorySPEBlockDiagram支持一個SIMD線程:直接可訪問Localstore(256KB),localstore通過DMA與memory交換數據MFCDetailMemoryFlowControlSystemDMAUnitLS<->LS,LS<->SysMemory,LS<->I/OTransfers8PPE-sideCommandQueueentries16SPU-sideCommandQueueentriesMMUsimilartoPowerPCMMU8SLBs,256TLBs4K,64K,1M,16MpagesizesSoftware/HWpagetablewalkPT/SLBmissesinterruptPPEAtomicCacheFacility4cachelinesforatomicupdates2cachelinesforcastout/MMUreloadUpto16outstandingDMArequestsinBIUResource/BandwidthManagementTablesTokenBasedBusAccessManagementTLBLockingIsolationModeSupport(SecurityFeature)Hardwareenforced“isolation”SPUandLocalStorenotvisible(busorjtag)SmallLS“untrustedarea”forcommunicationareaSecureBootChipSpecificKeyDecrypt/AuthenticateBootcode“SecureVault”–RuntimeIsolationSupportIsolateLoadFeatureIsolateExitFeatureElementInterconnectBus-DataTopologyFour16Bdataringsconnecting12buselementsTwoclockwise/Twocounter-clockwise兩個順時針/兩個逆時針Physicallyoverlapsallprocessorelements與所有處理器元件物理重疊Centralarbitersupportsuptothreeconcurrenttransfersperdataring中央仲裁器支持每個數據環(huán)最多三個并發(fā)傳輸Twostage,dualroundrobinarbiter兩階段雙循環(huán)仲裁者Eachelementportsimultaneouslysupports16Binand16BoutdatapathRingtopologyistransparenttoelementdatainterface16提綱
ArchitectureoftheCell
Cellprogrammingmodel
TheIBMFullSystemSimulatorCellBERuntimeEnvironmentLinuxKernelSupportCellBEThreadsandTasksModelBasicPPEandSPEProgramControlandDataFlowCell/BEBasicProgrammingConceptsCellBERuntimeEnvironmentLinux內核支持單元BE線程和任務模型基本PPE和SPE程序控制和數據流Cell/BE基本編程概念LinuxKernelSupportPPErunsPowerPCapplicationsandoperatingsystemsPPEhandlesthreadallocationandresourcemanagementamongSPEsPPE’sLinuxkernelcontrolstheSPUs’executionofprogramsScheduleSPEexecutionindependentfromregularLinuxthreadsResponsibleforruntimeloading,passingparameterstoSPEprograms,notificationofSPEeventsanderrors,anddebuggersupportPPE’sLinuxkernelmanagesvirtualmemory,includingmappingeachSPE’slocalstore(LS)andproblemstate(PS)intotheeffective-addressspaceThekernelalsocontrolsvirtual-memorymappingofMFCresources,aswellasMFCsegment-faultandfaulthandlingLargepages(16-MBpages,usingthehugetlbfsLinuxextension)aresupportedLinuxKernelSupportPPE運行PowerPC應用程序和操作系統(tǒng)PPE處理SPE之間的線程分配和資源管理PPE的Linux內核控制SPU執(zhí)行程序安排SPE執(zhí)行獨立于常規(guī)Linux線程負責運行時加載,將參數傳遞給SPE程序,SPE事件和錯誤的通知以及調試器支持PPE的Linux內核管理虛擬內存,包括將每個SPE的本地存儲(LS)和問題狀態(tài)(PS)映射到有效地址空間內核還控制MFC資源的虛擬內存映射以及MFC段故障和頁錯誤處理支持大頁面(16MB頁面,使用hugetlbfsLinux擴展)CellBEThreadsandTasksModelPPEprogram–LinuxtasksaLinuxtaskcaninitiateoneormore“SPEthreads”SPEprogram–“l(fā)ocal”SPEexecutables(“SPEthreads”)SPEexecutablesarepackagedinsidePPEexecutablefilesAnSPEthread:isinitiatedbyataskrunningonthePPEisassociatedwiththeinitiatingtaskonthePPErunsasynchronouslyfrominitiatingtaskhasauniqueidentifierknowntoboththeSPEthreadandtheinitiatingtaskcompletesatreturnfrommainintheSPEcodeCellBEThreadsandTasksModelPPE程序-Linux任務Linux任務可以啟動一個或多個“SPE線程”SPE程序-“本地”SPE可執(zhí)行文件(“SPE線程”)SPE可執(zhí)行文件打包在PPE可執(zhí)行文件中SPE線程:由在PPE上運行的任務啟動與PPE上的啟動任務相關聯從啟動任務異步運行具有SPE線程和啟動任務都已知的唯一標識符在SPE代碼返回主站完成AnSPEgroupacollectionofSPEthreadsthatshareschedulingattributesthereisadefaultgroupwithdefaultattributeseachSPEthreadbelongstoexactlyoneSPEgroupBasicPPEandSPEProgramControlandDataFlow(PPEProgram)LoadstheSPEprogramtotheLS.(PPEProgram)InstructstheSPEstoexecutetheSPEprogram.(SPEProgram)TransfersrequireddatafromthemainmemorytotheLS.(SPEProgram)Processesthereceiveddatainaccordancewiththerequirements.(SPEProgram)TransferstheprocessedresultfromtheLStothemainmemory.(SPEProgram)NotifiesthePPEprogramoftheterminationofprocessing.一個SPE組共享調度屬性的SPE線程的集合有一個具有默認屬性的默認組每個SPE線程僅屬于一個SPE組基本PPE和SPE程序控制和數據流(PPE程序)將SPE程序加載到LS。(PPE程序)指示SPE執(zhí)行SPE程序。(SPE程序)將所需的數據從主存儲器傳輸到LS。(SPE程序)根據要求處理接收到的數據。(SPE程序)將處理后的結果從LS傳送到主存儲器。(SPE程序)通知PPE程序終止處理。Cell/BEBasicProgrammingConceptsSPEsandPPEareheterogeneousmulticoresystemdifferentarchitecturesdisjointaddressspacesdifferentmodelsofmemoryandresourceprotectionrequiretwodistincttoolchainsforsoftwaredevelopment.ThePPEisjustaPowerPCrunningLinux.Nospecialprogrammingtechniquesorcompilersareneeded.ThePPEmanagesSPEprocessesasPOSIXpthreads.IBM-providedlibrary(libspe2)handlesSPEprocessmanagementwithinthethreads.LIBSPE:SPEruntimemanagementlibraryCompilertoolsembedSPEexecutablesintoPPEexecutables:onefileprovidesinstructionsforallexecutionunits.Cell/BEBasicProgrammingConceptsSPE和PPE是異構多核系統(tǒng)不同的架構不相交的地址空間不同型號的內存和資源保護需要兩個不同的工具鏈來進行軟件開發(fā)。PPE只是運行Linux的PowerPC。不需要特殊的編程技術或編譯器。PPE將SPE進程管理為POSIXpthread。IBM提供的庫(libspe2)處理線程內的SPE進程管理。LIBSPE:SPE運行時管理庫編譯器工具將SPE可執(zhí)行文件嵌入到PPE可執(zhí)行文件中:一個文件為所有執(zhí)行單元提供指令。SPEruntimemanagementlibrary:LIBSPETheSPEruntimemanagementlibrary(libspe)containsanSPEthreadprogrammingmodelforCellBEapplicationsisusedtocontrolSPEprogramexecutionfromthePPEprogramHandlesSPEsasvirtualobjectscalledSPEcontextsSPEprogramscanbeloadedandexecutedbyoperatingSPEcontextsSPEruntimemanagementlibrary:LIBSPESPE運行時管理庫(libspe)包含用于CellBE應用程序的SPE線程編程模型用于從PPE程序中控制SPE程序執(zhí)行處理SPE作為稱為SPE上下文的虛擬對象SPE程序可以通過操作SPE上下文來加載和執(zhí)行CellProgrammingWorkshop12/31/2024ClassObjectivesYouwilllearnhowtowrite,buildandrun“HelloWorld!”ontheCellSystemSimulatorNavigatethroughthebasicbuildprocessandmakefilesFamiliarizewithgccandxlccompilersFamiliarizewiththesystemsimulatorTherearethreedifferentversionsof“HelloWorld!”usedinthissessionPPEonly,SPEonly,andCellBE,i.e.usingbothPPEandSPESynchronousAsynchronousCellProgrammingWorkshop12/31/2024ClassObjectives您將學習如何在CellSystemSimulator上編寫,構建和運行“HelloWorld!”瀏覽基本構建過程并制作文件熟悉gcc和xlc編譯器熟悉系統(tǒng)模擬器在這個會話中使用了三個不同版本的“HelloWorld!”僅限PPE,只有SPE,和CellBE,即使用PPE和SPE兩者同步異步CellProgrammingWorkshop12/31/2024Howtobuild,compileandexecutethe“HelloWorld!”programPre-requisitesToolchainCompilerBuildProcessSourceCodeMakefilesSourcePPESourceSPESimulatorGettingthebinaryintothesimulatorRunningthebinaryCellProgrammingWorkshop12/31/2024Howtobuild,compileandexecutethe“HelloWorld!”program先決條件工具鏈編譯器構建過程源代碼Makefile文件來源PPE來源SPE模擬器將二進制進入模擬器運行二進制文件CellProgrammingWorkshop12/31/2024CellBE
ExecutableBuildProcessSPECodeSPE
ToolchainSPE
ObjectsEmbed
UtilityPPEObjPPECodePPE
ToolchainPPE
ObjectsPPE
Toolchain
(Linker)SPECodeSPE
CodePPECodeCellProgrammingWorkshop12/31/2024CompilingwithintheSDKTopofbuildenvironmentis/opt/cell/sdk/Includesthebuildenvironmentfiles包括構建環(huán)境文件README_build_env.txtProvidesdetailsonthebuildenvironmentfeatures,includingfiles,structureandvariables.提供有關構建環(huán)境功能的詳細信息,包括文件,結構和變量。make.footerSpecifiesallofthebuildrulesneededtoproperlybuildCBEAbinaries指定正確構建CBEA二進制文件所需的所有構建規(guī)則MustbeincludedinallSDKMakefiles(referencedrelativelyif$CELL_TOPisnotdefined)必須包含在所有SDKMakefile中(如果沒有定義$CELL_TOP,則引用)Includesmake.headermake.headerSpecifiesdefinitionsneededtoprocesstheMakefiles指定處理Makefile所需的定義Includesmake.envmake.envSpecifiesthedefaultcompilersandtoolstobeusedbymake指定make使用的默認編譯器和工具make.footerandmake.headershouldnotbemodifiedCellProgrammingWorkshop12/31/2024CommonMakefilevariablesDIRS listofsubdirectoriestobuildfirstPROGRAM_ppu PROGRAMS_ppu32-bitPPUprogram(orlistofprograms)tobuild.PROGRAM_ppu64 PROGRAMS_ppu6464-bitPPUprogram(orlistofprograms)tobuild.PROGRAM_spu PROGRAMS_spuSPUprogram(orlistofprograms)tobuild.Ifwrittenasastandalonebinary,canrunwithoutbeingembeddedinaPPUprogram.LIBRARY_embed LIBRARY_embed64CreatesalinkedlibraryfromanSPUprogramtobeembeddedintoa32-bitor64-bitPPUprogram.OBJS OBJS_<program> Listofobjectsfortheprograms(oronespecificprogram).Bydefault,allobjectsinthecurrentdirectoryarelinkedintothebinary.IMPORTS IMPORTS_<program>Listoflibrariestolinkintheprograms(oronespecificprogram).AlsousedbythePPUprogramstoembedtheSPUlinkedlibrary.CellProgrammingWorkshop12/31/2024DirectoryLayoutandExamplesofMakefilesamplesample.hMakefilesample/spuMakefilesample_spu.csample/ppuMakefilesample.cDIRS=spuppuinclude$(CELL_TOP)/buildutils/make.footerPROGRAM_spu=sample_spuLIBRARY_embed=lib_sample_spu.ainclude$(CELL_TOP)/buildutils/make.footerPROGRAM_ppu=sampleIMPORTS=../spu/lib_sample_spu.ainclude$(CELL_TOP)/buildutils/make.footerCellProgrammingWorkshop12/31/2024BuildingTheCodeEnvironmentsetupSettheCELL_TOPenvironmentvariablesothatthemakefilesystemcanbefound:exportCELL_TOP=/opt/cell/sdk/make.footercontainsthebuildrulesforthemakefilesystemEnsurecompilersorcross-compilersareintheexecutablesearchpathSeparateSPEcodeandPPEcodeintodifferentdirectoriesEachsetofcodehasit’sownmakefileandtoolchaintouseSuggestion:createasubdirectorycalled‘spu’inthedirectorywherethePPUcodeisfoundMakefiletemplateforPPEcode:DIRS=spuPROGRAM_ppu=<PPU_executable_name>IMPORTS=<spu_executable-embed.a>-lspe2include$(CELL_TOP)/buildutils/make.footerMakefiletemplateforSPEcode:PROGRAM_spu=<SPU_executable_name>LIBRARY_embed=<spu_executable-embed.a>include$(CELL_TOP)/buildutils/make.footerCellProgrammingWorkshop12/31/2024FourDifferentVersionsof“HelloWorld!”PPEonlySPEonlySynergisticPPEandSPE:synchronousOneSPEisused.MainthreadblocksandwaitsfortheSPEcodetoruntocompletionSynergisticPPEandSPE:asynchronousEightSPEsareusedMainthreadusespthreadstogetconcurrent/asynchronousexecutionCellProgrammingWorkshop12/31/2024FourDifferentVersionsof“HelloWorld!”僅PPE僅SPE協同PPE和SPE:同步使用一個SPE。主線程阻塞并等待SPE代碼運行完成協同PPE和SPE:異步使用八個SPE主線程使用pthreads來執(zhí)行并發(fā)/異步CellProgrammingWorkshop12/31/2024“HelloWorld!”–PPEOnlyPPUprogramjustlikeany“HelloWorld!”programonewouldwriteMakefilemake.footerincludedtosetupcompilerandcompilerflagsPROGRAM_pputellsmaketousePPCcross-compiler#include<stdio.h>intmain(void){printf("Helloworld!\n");return0;}PROGRAM_ppu=hello_ppuinclude$(CELL_TOP)/buildutils/make.footerPROGRAM_pputellsmaketousePPCcompilerCellProgrammingWorkshop12/31/2024“HelloWorld!”–SPEOnlySPUProgramSPUMakefile#include<stdio.h>intmain(){printf("Helloworld!\n");return0;}PROGRAM_spu:=hello_spuinclude$(CELL_TOP)/buildutils/make.footerPROGRAM_sputellsmaketouseSPEcompilerCellProgrammingWorkshop12/31/2024SynergisticPPEandSPE(SPEEmbedded)ApplicationsusesoftwareconstructscalledSPEcontextstomanageandcontrolSPEs.LinuxschedulesSPEcontextsfromallrunningapplicationsontothephysicalSPEresourcesinthesystemforexecutionaccordingtotheschedulingprioritiesandpoliciesassociatedwiththerunableSPEcontexts.libspeprovidesthemeansforcommunicationanddatatransferbetweenPPEthreadsandSPEs.CellProgrammingWorkshop12/31/2024SynergisticPPEandSPE(SPEEmbedded)應用程序使用稱為SPE上下文的軟件構造來管理和控制SPE。Linux根據與可運行的SPE上下文相關聯的調度優(yōu)先級和策略,將來自所有運行的應用程序的SPE上下文調度到系統(tǒng)中的物理SPE資源以執(zhí)行。libspe為PPE線程和SPE之間的通信和數據傳輸提供了手段。CellProgrammingWorkshop12/31/2024HowdoesaPPEprogramstartanSPEthread?4basicstepsmustbedonebythePPEprogramCreateanSPEcontext.LoadanSPEexecutableobjectintotheSPEcontextlocalstore.RuntheSPEcontext.Thistransferscontroltotheoperatingsystem,whichrequeststheactualschedulingofthecontextontoaphysicalSPEinthesystem.DestroytheSPEcontext.CellProgrammingWorkshop12/31/2024HowdoesaPPEprogramstartanSPEthread?PPE程序必須執(zhí)行4個基本步驟創(chuàng)建一個SPE上下文。將SPE可執(zhí)行對象加載到SPE上下文本地存儲中。運行SPE上下文。這將控制轉移到操作系統(tǒng),操作系統(tǒng)請求上下文的實際調度到系統(tǒng)中的物理SPE。銷毀SPE上下文。StructureofanSPEContextspe_context_ptr_tspeid;//speidisapointerpointstothespe_context_ptr_tdatastructurewhichservesastheidentifierforaspecificSPEcontextthatshouldberun.unsignedintflags=0;//flagsisabitmaskthatcanbeusedtorequestcertainspecificbehaviorfortheexecutionoftheSPEcontext.Ifthevalueis0,thisindicatesdefaultbehavior.unsignedintentry=SPE_DEFAULT_ENTRY;//entryisapointertothevariablecontainingtheinitialvalueoftheSPUinstructionpointer,theaddressatwhichtheSPEprogramshouldstartexecuting.IfthevalueofentryisSPE_DEFAULT_ENTRY,theentrypointfortheSPUmainprogramisobtainedfromtheloadedSPEimage.Thisisusuallythelocalstoreaddressoftheinitializationfunction.void*argp=NULL;//argpisapointertoapplicationspecificdata,andispassedasthesecondparametertotheSPEprogram’smain()functionvoid*envp=NULL;//envpisapointertoenvironmentspecificdata,andispassedasthethirdparametertotheSPEprogram’smain()functionspe_stop_info_tstop_info;//stop_infoisapointerpointstothespe_stop_info_tdatastructurewhichprovidesdetailedinformationwhytheSPEstoppeditsprogramexecution.StructureofanSPEContextspe_context_ptr_tspeid//speid是指向spe_context_ptr_t數據結構的指針,它用作應該運行的特定SPE上下文的標識符。unsignedintflags=0;//flags是一個位掩碼,可用于請求某些特定行為來執(zhí)行SPE上下文。如果值為0,則表示默認行為。unsignedintentry=SPE_DEFAULT_ENTRY;//entry是指向包含SPU指令指針的初始值的變量的指針,SPE指令指針是SPE程序應該開始執(zhí)行的地址。如果條目值為SPE_DEFAULT_ENTRY,則從加載的SPE圖像獲取SPU主程序的入口點。這通常是初始化函數的本地存儲地址。void*argp=NULL;//argp是指向應用程序特定數據的指針,并作為第二個參數傳遞給SPE程序的main()函數void*envp=NULL;//envp是指向環(huán)境特定數據的指針,并作為第三個參數傳遞給SPE程序的main()函數spe_stop_info_tstop_info;//stop_info是指向spe_stop_info_t數據結構的指針,它提供了SPE停止其程序執(zhí)行的詳細信息。CellProgrammingWorkshop12/31/2024SPEcontextcreationspe_context_create-CreateandinitializeanewSPEcontextdatastructure.
#include<libspe2.h> spe_context_ptr_tspe_context_create(unsignedintflags,spe_gang_context_ptr_tgang)flags-Abit-wiseORofmodifiersthatareappliedwhentheSPEcontextiscreated.gang-AssociatethenewSPEcontextwiththisgangcontext.IfNULLisspecified,thenewSPEcontextisnotassociatedwithanygang.Onsuccess,apointertothenewlycreatedSPEcontextisreturned.CellProgrammingWorkshop12/31/2024spe_program_loadspe_program_load-LoadanSPEmainprogram.
#include<libspe2.h> intspe_program_load(spe_context_ptr_tspe,spe_program_handle_t*program)spe-AvalidpointertotheSPEcontextforwhichanSPEprogramshouldbeloaded.program-AvalidaddressofamappedSPEprogram.CellProgrammingWorkshop12/31/2024spe_context_runspe_context_run-RequestexecutionofanSPEcontext.
#include<libspe2.h> intspe_context_run(spe_context_ptr_tspe,unsignedint*entry,unsignedintrunflags,void*argp,void*envp,spe_stop_info_t*stopinfo)spe-ApointertotheSPEcontextthatshouldberun.entry-Input:Theentrypoint,thatis,theinitialvalueoftheSPUinstructionpointer,atwhichtheSPEprogramshouldstartexecuting.IfthevalueofentryisSPE_DEFAULT_ENTRY,theentrypointfortheSPUmainprogramisobtainedfromtheloadedSPEimage.Thisisusuallythelocalstoreaddressoftheinitializationfunctioncrt0.runflags-AbitmaskthatcanbeusedtorequestcertainspecificbehaviorfortheexecutionoftheSPEcontext.0indicatesdefaultbehavior.argp-An(optional)pointertoapplicationspecificdata,andispassedasthesecondparametertotheSPEprogram,envp-An(optional)pointertoenvironmentspecificdata,andispassedasthethirdparametertotheSPEprogram,stopinfoAn(optional)pointertoastructureoftypespe_stop_info_tCellProgrammingWorkshop12/31/2024spe_context_destroyspe_context_destroy-DestroythespecifiedSPEcontext.
#include<libspe2.h>
intspe_context_destroy(spe_context_ptr_tspe)spe-SpecifiestheSPEcontexttobedestroyedOnsuccess,0(zero)isreturned,else-1isreturnedLIBSPEVersion2.0–SinglethreadAsimpleapplicationusesasinglePPEthread,thatis,theapplication’sPPEthreadThebasicschemeforasimpleapplicationusinganSPEisasfollows:CreateanSPEcontextLoadanSPEexecutableobjectintotheSPEcontext’slocalstoreRunSPEcontext–thistransferscontroltotheoperatingsystemrequestingtheactualschedulingofthecontexttoaphysicalSPEinthesystemDestroySPEcontextNotethatstep3.aboverepresentsasynchronouscalltotheoperatingsystem.Thecallingapplication’sPPEthreadblocksuntiltheSPEstopsexecutionandtheoperatingsystemreturnsfromthesystemcallinvokingtheSPEexecutionLIBSPEVersion2.0–Singlethread一個簡單的應用程序使用單個PPE線程,即應用程序的PPE線程使用SPE的簡單應用程序的基本方案如下:創(chuàng)建一個SPE上下文將SPE可執(zhí)行對象加載到SPE上下文的本地存儲中運行SPE上下文-這將控制轉移到操作系統(tǒng),請求實際調度上下文到系統(tǒng)中的物理SPE破壞SPE上下文請注意,上面的步驟3.代表對操作系統(tǒng)的同步調用。調用應用程序的PPE線程阻塞,直到SPE停止執(zhí)行,并且操作系統(tǒng)從調用SPE執(zhí)行的系統(tǒng)調用返回HelloWorldsinglethreadsample:SPEprogram
hello_spu#include<stdio.h>intmain(unsignedlonglongspeid,unsignedlonglongargp,unsignedlonglongenvp){printf("HelloWorld!\n");return0;}HelloWorldsinglethreadsample:SPEprogram
hello_ppu#include<errno.h>#include<stdio.h>#include<stdlib.h>#include<libspe2.h>externspe_program_handle_thello_spu;intmain(void){//StructureforanSPEcontextspe_context_ptr_tspeid;unsignedintflags=0;unsignedintentry=SPE_DEFAULT_ENTRY;void*argp=NULL;void*envp=NULL;spe_stop_info_tstop_info;intrc;//CreateanSPEcontextspeid=spe_context_create(flags,NULL);if(speid==NULL){perror("spe_context_create");return-2;}//LoadanSPEexecutableobjectintotheSPEcontextlocalstoreif(spe_program_load(speid,&hello_spu)){perror("spe_program_load");return-3;}//RuntheSPEcontextrc=spe_context_run(speid,&entry,0,argp,envp,&stop_info);if(rc<0)perror("spe_context_run");//DestroytheSPEcontextspe_context_destroy(speid);return0;}NameofanSPEprogramlibspe2headerfileLIBSPEVersion2.0–Multi-threadManyapplicationsneedtousemultipleSPEsconcurrentlyInthiscase,itisnecessaryfortheapplicationtocreateatleastasmanyPPEthreadsasconcurrentSPEcontextsarerequiredEachofthesePPEthreadsmayrunasingleSPEcontextatatimeIfNconcurrentSPEcontextsareneeded,itiscommontohaveamainapplicationthreadplusNPPEthreadsdedicatedtoSPEcontextexecutionThebasicschemeforasimpleapplicationrunningNSPEcontextsisCreateNSPEcontextsLoadtheappropriateSPEexecutableobjectintoeachSPEcontext’slocalstoreCreateNPPEthreadsIneachofthesePPEthreadsrunoneoftheSPEcontextsTerminatethePPEthreadWaitforallNPPEthreadstoterminateDestroyallNSPEcontextsLIBSPEVersion2.0–Multi-thread許多應用程序需要同時使用多個SPE在這種情況下,應用程序必須至少創(chuàng)建與并行SPE上下文相同數量的PPE線程這些PPE線程中的每一個可以一次運行單個SPE上下文如果需要N個并發(fā)SPE上下文,通常有一個主應用程序線程加上專用于SPE上下文執(zhí)行的NPPE線程運行NSPE上下文的簡單應用程序的基本方案是創(chuàng)建N個SPE上下文將相應的SPE可執(zhí)行對象加載到每個SPE上下文的本地存儲中創(chuàng)建N個PPE線程在每個這些PPE線程中運行一個SPE上下文終止PPE線程等待所有N個PPE線程終止銷毀所有NSPE上下文Multi-thread:SingleThreadAsynchronousExecutionExampleTheapplicationcreatesonePPEthreadThePPEthreadwillrunanSPEcontextatatimeThebasicschemeforasimpleapplicationrunning1SPEcontextsasynchronouslyisCreateanSPEcontextLoadtheappropriateSPEexecutableobjectintotheSPEcontext’slocalstoreCreateaPPEthreadRuntheSPEcontextinthePPEthreadTerminatethePPEthreadWaitforthePPEthreadtoterminateDestroytheSPEcontextMulti-thread:SingleThreadAsynchronousExecutionExample應用程序創(chuàng)建一個PPE線程PPE線程將一次運行SPE上下文運行1個SPE上下文的簡單應用程序的基本方案是異步運行創(chuàng)建一個SPE上下文將相應的SPE可執(zhí)行對象加載到SPE上下文的本地存儲中創(chuàng)建一個PPE線程在PPE線程中運行SPE上下文終止PPE線程等待PPE線程終止銷毀SPE上下文#include<errno.h>#include<stdio.h>#include<stdlib.h>#include<libspe2.h>#include<pthread.h>//StructureforanSPEthreadtypedefstructppu_pthread_data{spe_context_ptr_tcontext;pthread_tpthread;unsignedintentry;unsignedintflags;void*argp;void*envp;spe_stop_info_tstopinfo;}ppu_pthread_data_t;//pthreadfunctiontoruntheSPEcontextvoid*ppu_pthread_function(void*arg){ppu_pthread_data_t*datap=(ppu_pthread_data_t*)arg;intrc;rc=spe_context_run(datap->context,&datap->entry,datap->flags,datap->argp,datap->envp,&datap->stopinfo);pthread_exit(NULL);}externspe_program_handle_thello_spu;intmain(void){ppu_pthread_data_tdata;data.context=spe_context_create(0,NULL);spe_program_load(data.context,&hello_spu);data.entry=SPE_DEFAULT_ENTRY;data.flags=0;data.argp=NULL;data.envp=NULL;pthread_create(&data.pthread,NULL,&ppu_pthread_function,&data);pthread_join(data.pthread,NULL);spe_context_destroy(data.context);return0;}MultiThreadAsynchronousExecutionExample#include<stdlib.h>#include<stdio.h>#include<errno.h>#include<libspe2.h>#include<pthread.h>externspe_program_handle_tsimple_spu;#defineSPU_THREADS6void*ppu_pthread_function(void*arg){spe_context_ptr_tctx;unsignedintentry=SPE_DEFAULT_ENTRY;ctx=*((spe_context_ptr_t*)arg);if(spe_context_run(ctx,&entry,0,NULL,NULL,NULL)<0){perror("Failedrunningcontext");exit(1);}pthread_exit(NULL);}intmain(){inti;spe_context_ptr_tctxs[SPU_THREADS];pthread_tthreads[SPU_THREADS];for(i=0;i<SPU_THREADS;i++){//CreateseveralSPE-threadstoexecute'simple_spu'.if((ctxs[i]=
spe_context_create(0,NULL))==NULL){perror("Failedcreatingcontext");exit(1);}//Createcontextif(spe_program_load(ctxs[i],&simple_spu)){perror("Failedloadingprogram");exit(1); }//Loadprogramintocontextif(pthread_create(&threads[i],NULL,&ppu_pthread_function,&ctxs[i])){perror("Failedcreatingthread");exit(1);}//CreatethreadforeachSPEcontext}for(i=0;i<SPU_THREADS;i++){//WaitforSPU-threadtocompleteexecution.if(pthread_join(threads[i],NULL)){perror("Failedpthread_join");exit(1);}}printf("\nTheprogramhassuccessfullyexecuted.\n");return(0);}63提綱
ArchitectureoftheCell
Cellprogrammingmodel
TheIBMFullSystemSimulatorCellProgrammingWorkshop12/31/2024Intelx86SimulatorOverviewSystemSimPowerPCCaches(L1/L2)MemoryROMExecutablesUARTL3IntCtrlrPowerPCx86-64RuntimeandlibrariesPowerPCPowerPCBEBusLinux(FedoraCore5)SystemSoftware:Hypervisor,Linux/PPCorK42RealSystems:SystemSim:SimulationofhardwareSoftwareStack:RunningonSystemSimDMADisksConsoleWindowGUIWindowsApplicationSourceCodeProgrammingToolsCompilersProgrammingModelOpenMPMPIDevelopmentEnvironment:TracesCellProgrammingWorkshop12/31/2024SystemSimSystemSimRuntimeEnvironmenttcl/tk/blt/mambocmdsTCL/Tk/BLTGUIScripts41StartupTCLFile(.systemsim.tcl)#Createsimulatorinstancedefinedupcellmysim#Loadkernelbootimagemysimloadvmlinux./vmlinux0x1000000#StarttheGUIMamboInit::guiunix$PATH=/opt/IBM/systemsim-cell/bin:$PATHsystemsim-gGUIEnabledLicensedMaterials–PropertyofIBM.?CopyrightIBMCorporation2001,2006AllRightsReserved%5KernelBootImage(vmlinux)3DiskImage(sysroot_disk)NetworkServiceDaemonmysim(cellmodel)consoleROMnetmodelcpumodelmemory2diskmodelROMImage(rom.bin)CellProgrammingWorkshop12/31/2024Operating-SystemModesLinuxModeSimulatorbootsafullLinuxoperatingsystemonthesimulatedsystemApplicationsarelaunchedfromtheLinuxconsolewindowandrunThesimulatedoperatingsystemhandlesallthesystemcallsStandaloneModeTheapplicationisloadeddirectlyintothesimulatedmachinewithoutanoperatingsystemThesimulatortrapsallsystemcallsmadebytheapplicationandperformsthesefunctionsinplace
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯系上傳者。文件的所有權益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網頁內容里面會有圖紙預覽,若沒有圖紙預覽就沒有圖紙。
- 4. 未經權益所有人同意不得將文件中的內容挪作商業(yè)或盈利用途。
- 5. 人人文庫網僅提供信息存儲空間,僅對用戶上傳內容的表現方式做保護處理,對用戶上傳分享的文檔內容本身不做任何修改或編輯,并不能對任何下載內容負責。
- 6. 下載文件中如有侵權或不適當內容,請與我們聯系,我們立即糾正。
- 7. 本站不保證下載資源的準確性、安全性和完整性, 同時也不承擔用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。
最新文檔
- 個人2024年度在線教育服務合同5篇
- 海底兩萬里觀后感與探討
- 16大家一起來合作 (說課稿)-部編版道德與法治一年級下冊
- 3 古詩詞三首 西江月·夜行黃沙道中(說課稿)-2024-2025學年統(tǒng)編版語文六年級上冊
- 上饒衛(wèi)生學校二期建設項目(2024版)
- 會員協議書范本
- 保溫系統(tǒng)施工的合同范本
- 專用化學品銷售協議示例(2024年發(fā)布)版B版
- 專用設備買賣協議細則(2024版)版B版
- 專業(yè)鞋品采購協議模板2024版A版
- 社區(qū)老年人項目計劃書
- 《1.我又長大了一歲》教學課件∣泰山版
- 斷裂力學-1緒論課件
- 深基坑工程驗收表
- 醫(yī)學交流課件:RCT的基本概念及原則(PPT 37頁)
- SLZ 549-2012 用水審計技術導則(試行)
- qes三體系審核培訓ppt課件
- CASS文字編緝
- 農文旅一體化項目可行性研究報告寫作范文
- JJF 1406-2013 地面激光掃描儀校準規(guī)范(原版-高清)
- 轉爐系統(tǒng)機械設備概述
評論
0/150
提交評論