2015-FSE-Suggesting accurate method and class names(4)

来源：网络收集时间：2020-12-24 下载这篇文档手机版

说明：文章内容仅供预览，部分内容可能不全，需要完整文档或者需要复制内容，请下载word后使用。下载word有问题请添加微信号:或QQ：处理（尽可能给您提供完整文档），感谢您的支持与谅解。

sθ(.)=rcontextqisDone+bisDone

Figure2:VisualexplanationoftherepresentationandcomputationofcontextintheD-dimensionalspaceasde nedinEquation4;

the nalparagraphofSection2.2explainsthesumovertheItlocations.EachsourcecodetokenandfeaturemapstoalearnedD-dimensionalvectorincontinuousspace.Thetoken-vectorsaremultipliedwiththeposition-dependentcontextmatrixCiandsummed,thenaddedtothesumofallthefeature-vectors.TheresultingvectoristheD-dimensionalrepresentationofthecurrentsourcecodeidenti er.Finally,theinnerproductofthecontextandtheidenti ervectorsisaddedtoascalarbiasb,producingascoreforeachidenti er.Thisneuralnetworkisimplementedbymappingitsequationsintocode.isthatlogbilinearmodelsmakeitespeciallyeasytoexploitlong-distanceinformation;e.g.whenpredictingthenameofamethod,itisusefultotakeintoaccountalloftheidenti ersthatappearinthemethodbody.Wemodellong-distancecontextviaasetoffeaturefunctions,suchas“WhetheranyvariableinthecurrentmethodisnamedaddCount”,“Whetherthereturntypeofthecurrentmethodisint,”andsoon.Thelogbilinearcontextmodelcombinesthesefeatureswiththelocalcontext.

Asbefore,supposethatwearetryingtopredictacodetokentgivenasequenceofcontexttokensc=(c0,c1,...,cN).Weassumethatccontainsalloftheothertokensinthe lethatarerelevantforpredictingt;e.g.tokensfromthebodyofthemethodthattnames.Thetokensincthatarenearesttothetargettaretreatedspecially.Supposethattoccursinpositioniofthe le,thatis,ifthe leisthetokensequencet1,t2,...,thent=ti.ThenthelocalcontextisthesetoftokensthatoccurwithinKpositionsoft,thatis,theset{ti+k}for K≤k≤K,k=0.Thelocalcontextincludestokensthatoccurbothbeforeandaftert.

Theoverallformofthecontextmodelwillfollowthegeneric

cisformin(1)and(2),exceptthatthecontextrepresentationr

cusingtwode neddifferently.Inthecontextmodel,wede ner

differenttypesofcontext:localandglobal.First,thelocalcontextishandledinaverysimilarwaytothelogbilinearLM.Eachpossiblelexemevisassignedtoavectorrv∈RD,and,foreachtokentkthatoccurswithinKtokensoftinthe le,weadditsrepresentationrtkintothecontextrepresentation.

Theglobalcontextishandledusingasetoffeatures.Eachfeatureisabinaryfunctionbasedonthecontexttokensc,suchastheexamplesdescribedatthebeginningofthissection.Formally,eachfeaturefmapsacvaluetoeither0or1.MaddisonandTarlow[31]useasimilarideatorepresentfeaturesofasyntacticcontext,thatis,anodeinanAST.Here,weextendthisideatoincorporatearbitraryfeaturesoflong-distancecontexttokensc.The rstcolumnofTable4presentsthefulllistoffeaturesthatweuseinthiswork.Tolearnanembedding,weassigneachfeaturefunctiontoasinglevectorinthecontinuousspace,inthesamewayaswedidfortokens.Mathematically,letFbethesetofallfeaturesinthemodel,andletFc,foracontextc,bethesetofallfeaturesfwithf(c)=1.Thenforeachfeaturef∈F,welearnanembeddingrf∈RD,whichisincludedasaparametertothemodelinexactlythesamewaythatrtwasforthelanguagemodelingcase.

Now,wecanformallyde neacontextmodelofcodeasaprob-abilitydistributionP(t|c)thatfollowstheform(1)and(2),where c=r context,wherer contextisr

context=r

f∈Ftc

matrixthatisalsolearnedduringtraining1.Intuitively,thisequation

sumstheembeddingsofeachtokentkthatoccursneartinthe le,andsumstheembeddingsofeachfeaturefunctionfthatreturns

context,justtrue(i.e.,1)forthecontextc.Oncewehavethisvectorr

asbefore,wecanselectatokentsuchthattheprobabilityP(t|c)

ishigh,whichhappensexactlywhenrcontextqtishigh—inother

words,whentheembeddingqtoftheproposedtargettiscloseto

contextofthecontext.theembeddingr

Figure2givesavisualexplanationoftheprobabilisticmodel.This guredepictshowthemodelassignsprobabilitytothetokenisDoneiftheprecedingtwotokensarefinalbooleanandthesucceedingtwoare=false.Readingfromrighttoleft,the guredescribeshowthecontinuousembeddingofthecontextiscomputed.Followingthedashed(pink)arrows,thetokensinthelocalcontextareeachassignedtoD-dimensionalvectorsrfinal,rboolean,andsoon,whichareaddedtogether(aftermultiplicationbytheC kmatri-cesthatmodeltheeffectofdistance),toobtaintheeffectofthelocal

context.Thesolid(blue)arrowsrepresentcontextontheembeddingr

theglobalcontext,pointingfromthenamesofthefeaturefunc-tionsthatreturntruetothecontinuousembeddingsofthosefeatures.Addingthefeatureembeddingstothelocalcontextembeddings

context.Thesimilaritybetweenyieldsthe nalcontextembeddingr

thisvectorandembeddingofthetargetvectorqisDoneiscomputedusingadotproduct,whichyieldsthevalueofsθ(isDone,c)whichisnecessaryforcomputingtheprobabilityP(isDone|c)via(1).MultipleTargetTokensUptonow,wehavepresentedthemodelinthecasewherewearerenamingatargettokentthatoccursatonlyonelocation,suchasthenameofamethod.Othercases,suchaswhensuggestingvariablenames,requiretakingalloftheoccurrencesofanameintoaccount[2].Whenatokentappearsata

contextseparatelysetoflocationsIt,wecomputethecontextvectorsr

foreachtokenti,fori∈It,thenaveragethem.Whenwedothis,wecarefullyrenamealloccurrencesofttoaspecialtokencalledSELFtoremovetfromitsowncontext.

2.3SubtokenContextModelsofCode

∑

rf+

k:K≥|k|>0

∑

Ckrti+k,

(4)

Alimitationofallofthepreviousmodelsisthattheyareunabletopredictneologisms,thatis,unseenidenti ernamesthathavenotbeenusedinthetrainingset.Thereasonforthisisthatweallowthemapfromalexemevtoitsembeddingqvtobearbitrary(i.e.withoutlearningafunctionalformfortherelationship),sowehavenobasistoassigncontinuousvectorstoidenti ernamesthathavenotbeenobserved.Inthissection,wesidestepthisproblembyexploitingtheinternalstructureofidenti ernames,resultinginanewmodelwhichwecallasubtokencontextmodel.

1Notethatkcanbepositiveornegative,sothatingeneralC

where,asbefore,Ckisaposition-dependentD×Ddiagonalcontext=C2.

百度搜索“77cn”或“免费范文网”即可找到本站免费阅读全部范文。收藏本站方便下次阅读，免费范文网，提供经典小说综合文库2015-FSE-Suggesting accurate method and class names(4)在线全文阅读。

2015-FSE-Suggesting accurate method and class names(4).doc 将本文的Word文档下载到电脑，方便复制、编辑、收藏和打印下载失败或者文档不完整，请联系客服人员解决！

下载这篇word文档

本文链接：https://www.77cn.com.cn/wenku/zonghe/1169913.html（转载请注明文章来源）

上一篇：我国四大商业银行企业文化建设的问题及对策分析
下一篇：新概念蛋糕调研报告