码迷,mamicode.com
首页 > 其他好文 > 详细

summary 16--27

时间:2019-09-27 16:27:36      阅读:85      评论:0      收藏:0      [点我收藏+]

标签:similar   span   pap   amp   embedding   插入   ado   ase   back   

1. Paper Reading

$ \lfloor ab/p \rfloor p$

我是根据ICSE2019schedule上的不同方向的论文来读的,打算一个主题读一篇来确定方向。
目前读的是 Automated Program Repair 方向的
Learning to Spot and Refactor Inconsistent Method Names。

  • Problem definition: Spot and Refactor inconsistent method name. ( method has already implemented including method name and method body.)
  • Background
  1. Paragraph vector: use it to compute similarities among the method name .
  2. CNNS: use it to identify similar method body .
  3. Word2Vec : embedding the token of method body.
  • Our method:
    技术图片
    Unsupervised learning
    There are two phase in these method-- training and identification & suggestion.

a. Training phase takes tokens of method name and method body in the training set to produce two vector space--method name vector space and Method name space.

b. Inconsistency Identification
Given a method , they take a set of adjacent vectors for its name( \(n_i\)) and body (\(b_i\)) respectively (denoted \(adj(n_i)\) and \(adj(b_i)\))
adjacency is measured by cosine similarity.
when [method name of (\(adj(n_i)\))] \(\bigcap\) [method name of (\(adj(b_i)\)) ] = \(\emptyset\) , it is inconsistent.
To capture the similar behavior rather than the same name ,they relax the condition of consistency to take the first sub-token of each method name instead of comparing full method name,
when [first sub-token of method name of (\(adj(n_i)\))] \(\bigcap\) [ first sub-token of method name of (\(adj(b_i)\)) ] = \(\emptyset\) , it is inconsistent.

b. suggestion is to select new name in the set of method name of(adj(\(b_i\))) according to some ranking strategies.

summary 16--27

标签:similar   span   pap   amp   embedding   插入   ado   ase   back   

原文地址:https://www.cnblogs.com/enshengshi/p/11598628.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!