标签:similar span pap amp embedding 插入 ado ase back
$ \lfloor ab/p \rfloor p$
我是根据ICSE2019schedule上的不同方向的论文来读的,打算一个主题读一篇来确定方向。
目前读的是 Automated Program Repair 方向的
Learning to Spot and Refactor Inconsistent Method Names。
a. Training phase takes tokens of method name and method body in the training set to produce two vector space--method name vector space and Method name space.
b. Inconsistency Identification
Given a method , they take a set of adjacent vectors for its name( \(n_i\)) and body (\(b_i\)) respectively (denoted \(adj(n_i)\) and \(adj(b_i)\))
adjacency is measured by cosine similarity.
when [method name of (\(adj(n_i)\))] \(\bigcap\) [method name of (\(adj(b_i)\)) ] = \(\emptyset\) , it is inconsistent.
To capture the similar behavior rather than the same name ,they relax the condition of consistency to take the first sub-token of each method name instead of comparing full method name,
when [first sub-token of method name of (\(adj(n_i)\))] \(\bigcap\) [ first sub-token of method name of (\(adj(b_i)\)) ] = \(\emptyset\) , it is inconsistent.
b. suggestion is to select new name in the set of method name of(adj(\(b_i\))) according to some ranking strategies.
标签:similar span pap amp embedding 插入 ado ase back
原文地址:https://www.cnblogs.com/enshengshi/p/11598628.html