-
公开(公告)号:US20240385814A1
公开(公告)日:2024-11-21
申请号:US18241244
申请日:2023-09-01
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC.
Inventor: SHENGYU FU , JIN WOO JANG , NEELAKANTAN SUNDARESAN , ALEXEY SVYATKOVSKIY
Abstract: A hybrid inference system for a coding assistant utilizes a routing model to predict whether output generated by a large language model for a given prompt would be accepted by a user of the coding assistant. The routing model routes the prompt when the routing model indicates that the output generated by the large language model is likely to be accepted. The routing model routes the prompt to a local model when the output generated by the large language model is not likely to be accepted. The routing model is trained on the historical output generated by the large language model for various prompts and the acceptance or rejection of the output by users of the coding assistant.
-
公开(公告)号:US20210357193A9
公开(公告)日:2021-11-18
申请号:US16866433
申请日:2020-05-04
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC.
Inventor: SHENGYU FU , XIAOYU LIU , NEELAKANTAN SUNDARESAN
Abstract: A code completion tool uses machine learning models to more precisely predict the likelihood of a method invocation completing a code fragment that follows one or more method invocations of different classes in a same document during program development. In one aspect, the machine learning model is a n-order Markov chain model that is trained on features that represent characteristics of the context of method invocations found in commonly-used programs from a sampled population. The machine learning model is implemented as a hash table contained a ranked order of hash values in descending order of probability of completing a partially-formed method invocation.
-
公开(公告)号:US20190303109A1
公开(公告)日:2019-10-03
申请号:US16360008
申请日:2019-03-21
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC.
Inventor: SHENGYU FU , NEELAKANTAN SUNDARESAN , YING ZHAO
Abstract: A code completion tool uses machine learning models to more precisely predict the likelihood of an invocation of a particular overloaded method completing a code fragment that follows one or more method invocations of a same class in a same document during program development. In one aspect, the machine learning model is a n-order Markov chain model that is trained on features that represent the method signatures of overloaded methods in order to generate ordered sequences of method signatures of overloaded method invocations.
-
4.
公开(公告)号:US20250068419A1
公开(公告)日:2025-02-27
申请号:US18947617
申请日:2024-11-14
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC.
Inventor: SHENGYU FU , XIAOYU LIU , NEELAKANTAN SUNDARESAN , ALEXEY SVYATKOVSKIY
Abstract: A retrieval-augmented neural transformer model with chunk cross-attention predicts a code review given a proposed source code change, represented as a code diff hunk, and a set of historical code review comments. The code diff hunk represents proposed edits to a source code snippet with its surrounding context that has not been changed. The historical code review comments are associated with code edits that are semantically similar to the proposed source code changes. The code diff hunk is partitioned into chunks which are used to find semantically similar historical code review comments. The set of historical code review comments is aggregated and used to guide the model in makings its predictions.
-
公开(公告)号:US20240419917A1
公开(公告)日:2024-12-19
申请号:US18209935
申请日:2023-06-14
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC.
Inventor: COLIN BRUCE CLEMENT , SHENGYU FU , SPANDAN GARG , NEELAKANTAN SUNDARESAN , DONGJIANG YOU , ROSHANAK ZILOUCHIAN MOGHADDAM
Abstract: A customized prompt generation service automates prompts to a large language model to perform a specified software engineering task. The service stores the custom data of a client that includes code diff hunks, source code segments, code reviews, repaired code, and unit tests from a code base or repository of the client. Prompt templates are associated with each software engineering task that include the requisite information needed for the large language model to perform the target task. A prompt to a large language model includes examples of the software engineering task from the custom data of the client of the service.
-
公开(公告)号:US20230305824A1
公开(公告)日:2023-09-28
申请号:US17703169
申请日:2022-03-24
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC.
Inventor: MILTIADIS ALLAMANIS , SHENGYU FU , XIAOYU LIU , NEELAKANTAN SUNDARESAN , ALEXEY SVYATKOVSKIY
Abstract: A code adaptation mechanism automatically integrates the variable names of a pasted source code snippet into variable names defined in a pre-existing partial source code program. The variable names from the pasted source code snippet are replaced with anonymized values. A deep learning model predicts the most likely variable name from the pre-existing partial source code program to replace each anonymized value. The deep learning model is trained on numerous variable usage patterns from various source code programs to learn to predict the most likely mapping of an undefined variable name from the pasted source code snippet to a variable name in the pre-existing partial source code program thereby generating a syntactically and semantically correct program.
-
公开(公告)号:US20240160435A1
公开(公告)日:2024-05-16
申请号:US17985849
申请日:2022-11-12
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC.
Inventor: NAN DUAN , SHENGYU FU , SHUAI LU , NEELAKANTAN SUNDARESAN , ALEXEY SVYATKOVSKIY
Abstract: A deep learning model is pre-trained with a large-scale of unsupervised data of code review tasks in order to learn the relationships between code changes and a code review. The pre-trained deep learning model predicts a code review given a code diff hunk in a code diff format. The code diff hunk includes the changed code and its surrounding context. The pre-trained deep learning model may then be fine-tuned with supervised data in order to make predictions for several code review activities, such as, code change quality estimation and code refinement.
-
公开(公告)号:US20220326918A1
公开(公告)日:2022-10-13
申请号:US17851031
申请日:2022-06-28
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC.
Inventor: CHRISTIAN ALMA BIRD , SHENGYU FU , NEELAKANTAN SUNDARESAN , NINA WANG , SHUO ZHANG
IPC: G06F8/36 , G06F16/22 , G06F16/2458 , G06F8/30
Abstract: A data mining technique is used to find large frequently-occurring source code patterns from methods/APIs that can be used in code development. Simplified trees that represent the syntactic structure and type and method usage of a source code fragment, such as a method, are mined to find closed and maximal frequent subtrees which represent the largest frequently-occurring source code patterns or idioms associated with a particular type and method usage. These idioms are then used in an idiom web service and/or a code completion system to assist users in the development of source code programs.
-
公开(公告)号:US20210303279A1
公开(公告)日:2021-09-30
申请号:US16835306
申请日:2020-03-31
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC.
Inventor: CHRISTIAN ALMA BIRD , SHENGYU FU , NEELAKANTAN SUNDARESAN , NINA WANG , SHUO ZHANG
IPC: G06F8/36 , G06F16/2458 , G06F16/22 , G06F8/30
Abstract: A data mining technique is used to find large frequently-occurring source code patterns from methods/APIs that can be used in code development. Simplified trees that represent the syntactic structure and type and method usage of a source code fragment, such as a method, are mined to find closed and maximal frequent subtrees which represent the largest frequently-occurring source code patterns or idioms associated with a particular type and method usage. These idioms are then used in an idiom web service and/or a code completion system to assist users in the development of source code programs.
-
公开(公告)号:US20200249918A1
公开(公告)日:2020-08-06
申请号:US16377789
申请日:2019-04-08
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC.
Inventor: ALEXEY SVYATKOVSKIY , SHENGYU FU , NEELAKANTAN SUNDARESAN , YING ZHAO
Abstract: A code completion tool uses a deep learning model to predict the likelihood of a method completing a method invocation. In one aspect, the deep learning model is a LSTM trained on features that represent the syntactic context of a method invocation derived from an abstract tree representation of the code fragment.
-
-
-
-
-
-
-
-
-