-
公开(公告)号:US20230214212A1
公开(公告)日:2023-07-06
申请号:US17568597
申请日:2022-01-04
Applicant: Microsoft Technology Licensing, LLC
Inventor: David Ellis PUGH , Mark Alistair WILSON-THOMAS , Vu Minh LE
IPC: G06F8/71 , G06F40/197
CPC classification number: G06F8/71 , G06F40/197
Abstract: Distinguishing pattern differences from non-pattern differences. A set of differences is identified. The set comprises a plurality of differences between first and second versions of a document. A pattern is identified. The pattern explains a transformation from a first string in the first version of the document to a second string in the second version of the document. A subset of differences are identified. The subset comprises a plurality of differences, from among the set, which match the pattern. While presenting a user interface that visually highlights differences between the first and second versions of the document, a first visual treatment is applied to a first difference, based on the first difference being included in the subset. A second visual treatment is also applied to a second difference, based on the second difference being excluded from the subset. The second visual treatment is different than the first visual treatment.
-
公开(公告)号:US20230176829A1
公开(公告)日:2023-06-08
申请号:US17544502
申请日:2021-12-07
Applicant: Microsoft Technology Licensing, LLC
Inventor: Kiarash RAHMANI , Mohammad RAZA , Sumit GULWANI , Vu Minh LE , Daniel James MORRIS , Arjun RADHAKRISHNA , Gustavo ARAUJO SOARES , Ashish TIWARI
Abstract: Embodiments use a multi-modal approach to generate software programs that match a solution program description. The solution program description may include natural language, input-output examples, partial source code, desired operators, or other hints. Some embodiments use optimized prompts to a pre-trained language model to obtain initial candidate programs. Maximal program components are extracted and then recombined variously using component-based synthesis. Beam search reduces a solution program search space by discarding some candidates from a given synthesis iteration. Relevance metrics, string similarity metrics, operator frequency distributions, token rareness scores, and other optimizations may be employed. By virtue of optimizations and the multi-modal approach, a solution program may be obtained after fewer iterations than by use of a language model alone. The multi-modal approach is domain agnostic, as illustrated by examples using regular expression and cascading style sheet selector domain specific languages.
-
公开(公告)号:US20250077538A1
公开(公告)日:2025-03-06
申请号:US18241028
申请日:2023-08-31
Applicant: Microsoft Technology Licensing, LLC
Inventor: Shaily Jignesh FOZDAR , David Joseph DONAHUE , Fang LIU , Noelle Yanhui LI , Abhishek NARAIN , Irene Rogan SHAFFER , Wee Hyong TOK , Ehimwenma NOSAKHARE , Vivek GUPTA , Gust VERBRUGGEN , Vu Minh LE , Jordan Joseph HENKEL , Avrilia FLORATOU , Joyce Yu CAHOON , Richard ANARFI , Jason Wang , Daniel MUÑOZ HUERTA , Yan Qiu
IPC: G06F16/25 , G06F16/242
Abstract: A computer-implemented method for performing natural language-based data integration includes causing execution of a data integration application on a remote device via a network and causing surfacing of a GUI corresponding to the data integration application on a display of the remote device. The method includes receiving, via the GUI, a natural language input representing a data integration task, generating, via an LLM, a set of ordered activities corresponding to the data integration task represented by the natural language input, and selecting, via the LLM, one or more APIs for performing each activity within the set of ordered activities. The method also includes generating a data pipeline based on the set of ordered activities and the API(s) for performing each activity, as well as back-translating the data pipeline to a desired data format for execution by the data integration application.
-
公开(公告)号:US20220317979A1
公开(公告)日:2022-10-06
申请号:US17220156
申请日:2021-04-01
Applicant: Microsoft Technology Licensing, LLC
Inventor: Gustavo ARAUJO SOARES , Piyush ARORA , Titus BARIK , Peter GROENEWEGEN , Sumit GULWANI , Ameya Sanjay KETKAR , Vu Minh LE , Wode NI , David Ellis PUGH , Arjun RADHAKRISHNA , Ivan RADICEK , Ashish TIWARI , Mark Alistair WILSON-THOMAS
IPC: G06F8/33 , G06F16/901
Abstract: Edit automation functionality generalizes edits performed by a user in a document, locates similar text, and recommends or applies transforms while staying within a current workflow. Source code edits such as refactoring are automated. The functionality uses or provides anchor target lists, temporal edit patterns, edit graphs, automatable edit sequence libraries, and other data structures and computational techniques for identifying locations appropriate for particular edits, for getting transforms, for selecting optimal transforms, for leveraging transforms in an editing session or later, and for displaying transform recommendations and results. The edit automation functionality enhances automation subtool generation, discoverability, and flexibility, for refactoring, snippet insertion, quick actions in an integrated development environment, and other automatable edit sequences.
-
公开(公告)号:US20200334054A1
公开(公告)日:2020-10-22
申请号:US16592470
申请日:2019-10-03
Applicant: Microsoft Technology Licensing, LLC
Inventor: Sumit GULWANI , Arjun RADHAKRISHNA , Abhishek UDUPA , Gustavo ARAUJO SOARES , Vu Minh LE , Anders MILTNER , Mark A. WILSON-THOMAS
Abstract: Automatically identifying context-specific repeated transformations (such as repeated edit tasks) that are based on observation of the developer drafting or modifying code. As the developer modifies the code, the code passes through a series of states, one after the other. The computing system observes the series of states of the code. It is based on this observation that the computing system identifies repeated transformations of the code for potentially offering to continue performing the repeated transformations for the user. This alleviates the developer from having to manually perform the remainder of the repeated transformations.
-
公开(公告)号:US20240256423A1
公开(公告)日:2024-08-01
申请号:US18159712
申请日:2023-01-26
Applicant: Microsoft Technology Licensing, LLC
Inventor: Jialu ZHANG , José Pablo CAMBRONERO SÁNCHEZ , Gustavo ARAUJO SOARES , Vu Minh LE , Sumit GULWANI , Gust Ben Anneloes VERBRUGGEN
CPC classification number: G06F11/3608 , G06F8/42 , G06F8/71
Abstract: Some embodiments generate prompts and submit them in queries to a language model trained on code to perform automated program repair. Some embodiments fix syntactic mistakes and semantic mistakes by combining multimodal prompts, iterative querying, test-case-based selection of few-shots, and program chunking. In some cases, edit distance is minimized between an initial flawed program and the automatically created improved version of that program. The initial flawed program is obtained from a programming student, or from a source code generator.
-
公开(公告)号:US20230280989A1
公开(公告)日:2023-09-07
申请号:US17687577
申请日:2022-03-04
Applicant: Microsoft Technology Licensing, LLC
Inventor: José Pablo CAMBRONERO SÁNCHEZ , Sumit GULWANI , Vu Minh LE , Daniel PERELMAN , Arjun RADHAKRISHNA , Daniel Galen SIMMONS , Clint Michael SIMON , Ashish TIWARI
IPC: G06F8/41 , G06F40/211 , G06F40/30
CPC classification number: G06F8/436 , G06F8/427 , G06F40/211 , G06F40/30
Abstract: Techniques are described herein that are capable of synthesizing a computer program to include idiomatic function(s) and semantically-meaningful variable(s) using programming by example. For instance, an intent of a user to synthesize a computer program to include functionality configured to generate sample output(s) from respective input(s) is determined based at least in part on receipt of the sample input(s) and the respective sample output(s) from the user. Based at least in part on the determined intent, the computer program is synthesized to include the idiomatic function(s) by configuring the idiomatic function(s) to have the target functionality and to conform to a convention of the target domain-specific language associated with a textual representation of the computer program to be displayed to the user. Non-semantically-meaningful variable(s) included among the idiomatic function(s) are replaced with the respective semantically-meaningful variable(s). The textual representation of the computer program is caused to be displayed to the user.
-
公开(公告)号:US20230229850A1
公开(公告)日:2023-07-20
申请号:US17576652
申请日:2022-01-14
Applicant: Microsoft Technology Licensing, LLC
Inventor: Mohammad RAZA , Arjun RADHAKRISHNA , José Pablo CAMBRONERO SÁNCHEZ , Sumit GULWANI , John Francis LAM , Vu Minh LE , Daniel MORRIS , Daniel Adam PERELMAN , Daniel Galen SIMMONS , Gustavo ARAUJO SOARES , Ashish TIWARI
IPC: G06F40/154 , G06F9/54 , G06F40/143 , G06N20/00 , G06F40/18 , G06F40/183
CPC classification number: G06F40/154 , G06F9/543 , G06F40/143 , G06N20/00 , G06F40/18 , G06F40/183
Abstract: Pasting content from a clipboard buffer as structured tabular data. A computer system determines a data type of content within a clipboard buffer. Based on the data type of the content, the computer system identifies a tabular pattern analysis technique to apply to the content. Based on applying the tabular pattern analysis technique to the content, the computer system identifies a portion of tabular content within the content. Using a clipboard application programming interface, the computer system presents the portion of tabular content to an application as paste data that is structured as a set of rows and a set of columns.
-
公开(公告)号:US20220317978A1
公开(公告)日:2022-10-06
申请号:US17220131
申请日:2021-04-01
Applicant: Microsoft Technology Licensing, LLC
Inventor: Titus BARIK , Gustavo ARAUJO SOARES , Piyush ARORA , Peter GROENEWEGEN , Sumit GULWANI , Ameya Sanjay KETKAR , Vu Minh LE , Wode NI , David Ellis PUGH , Arjun RADHAKRISHNA , Ivan RADICEK , Ashish TIWARI , Mark Alistair WILSON-THOMAS
Abstract: Edit automation functionality generalizes edits performed by a user in a document, locates similar text, and recommends or applies transforms while staying within a current workflow. Source code edits such as refactoring are automated. The functionality uses or provides anchor target lists, temporal edit patterns, edit graphs, automatable edit sequence libraries, and other data structures and computational techniques for identifying locations appropriate for particular edits, for getting transforms, for selecting optimal transforms, for leveraging transforms in an editing session or later, and for displaying transform recommendations and results. The edit automation functionality enhances automation subtool generation, discoverability, and flexibility, for refactoring, snippet insertion, quick actions in an integrated development environment, and other automatable edit sequences.
-
公开(公告)号:US20210349698A1
公开(公告)日:2021-11-11
申请号:US16869414
申请日:2020-05-07
Applicant: Microsoft Technology Licensing, LLC
Inventor: Mark Alistair WILSON-THOMAS , Vu Minh LE , Sumit GULWANI
Abstract: Flexible yet efficient “find” operations search source code for matches to a general pattern after a developer provides an example string that matches the pattern, without requiring the developer to write a regular expression or script that will implement the desired pattern. Example-driven find-replace functionality uses regular expressions or other pattern match codes, and scripts or other transforms, which are synthesized automatically from examples provided by a developer. This technology allows the developer to focus on workflow inside an integrated development environment instead of breaking focus to search for external documentation, or unfortunately foregoing the flexibility and power of regular expressions and scripts. Synthesizer outputs may be directly or indirectly ranked through user feedback, allowing their refinement. Find match generality may be controlled, e.g., by specifying regex star positions or star counts. Entry of guiding examples may be assisted by autocompletion. Performance criteria are also described.
-
-
-
-
-
-
-
-
-