专利检索 ap:("International Business Machines Corporation") AND inv:"Mal Pattiarachi" 第 1 页

1.

发明授权
Creating synthetic visual inspection data sets using augmented reality 有权

公开(公告)号：US11868444B2

公开(公告)日：2024-01-09

申请号：US17380075

申请日：2021-07-20

申请人： International Business Machines Corporation

发明人： Michael Charles Hollinger , Mal Pattiarachi , Abhinav Pratap Singh

IPC分类号： G06F18/40 , G06T19/00 , G06T19/20 , G06N20/00 , G06V20/20 , G06F18/214 , G06F18/21

CPC分类号： G06F18/40 , G06F18/214 , G06F18/217 , G06N20/00 , G06T19/006 , G06T19/20 , G06V20/20 , G06T2200/24 , G06T2219/2016

摘要： In an approach for creating synthetic visual inspection data sets for training an artificial intelligence computer vision deep learning model utilizing augmented reality, a processor enables a user to capture a plurality of images of an anchor object using a camera on a user computing device. A processor receives the plurality of images of the anchor object from the user. A processor generates a baseline model of an anchor object. A processor generates a training data set. A processor trains the baseline model of the anchor object. A processor creates a trained Artificial Intelligence (AI) computer vision deep learning model. A processor enables the user to interact with the trained AI computer vision deep learning model in an access mode.

2.

发明申请
Prospective Voice User Interface Modality Identification 审中-公开

公开(公告)号：US20190121619A1

公开(公告)日：2019-04-25

申请号：US15790158

申请日：2017-10-23

申请人： International Business Machines Corporation

发明人： Blaine H. Dolph , David M. Lubensky , Mal Pattiarachi , Marco Pistoia , Nitendra Rajput , Justin Weisz

IPC分类号： G06F9/44 , G06F3/0484 , G06K9/00

摘要： Techniques are disclosed for identifying which graphical user interface (GUI) screens of an application that is under development would benefit from a voice user interface (VUI). A GUI screen parser analyzes to determine the GUI objects within GUI screens of the application. The parser assigns a speechability score to each analyzed GUI screen. Those GUI screens that have a higher speechability score than a predetermined speechability threshold are indicated as GUI screens that would benefit (e.g., the user experience in interacting with those GUI screens would increase, the number of GUI screens displayed would be reduced, or the like) with the addition of a VUI.

3.

发明授权
Training data optimization in a service computing system for voice enablement of applications 有权

公开(公告)号：US10565982B2

公开(公告)日：2020-02-18

申请号：US15808169

申请日：2017-11-09

申请人： International Business Machines Corporation

发明人： Blaine H. Dolph , David M. Lubensky , Mal Pattiarachi , Marcus D. Roy , Justin Weisz

IPC分类号： G10L15/01 , G10L15/06 , G06F3/16 , G10L13/04 , G10L15/26 , G06N20/00

摘要： Techniques for optimizing training data within voice user interface (VUI) of an application under development are disclosed. A VUI feedback module synthesizes human speech of a training phrase. This phrase is presented upon a speaker which is simultaneously captured upon a microphone. A speech to text framework converts the synthesized training phrase into text (textualized training phrase). The VUI feedback module compares the textualized training phrase to the actual training phrase and generates a speech training data structure that identifies similarities or dissimilarities between the textualized training phrase and the actual training phrase. This data structure may be utilized by an application developer computing system to identify training data that is most venerable to misinterpretation when a user interacts with the VUI. The VUI may subsequently be adjusted to account for the vulnerabilities to improve operations or user experience of the VUI.

4.

发明申请
Training Data Optimization in a Service Computing System for Voice Enablement of Applications 审中-公开

公开(公告)号：US20190138270A1

公开(公告)日：2019-05-09

申请号：US15808169

申请日：2017-11-09

申请人： International Business Machines Corporation

发明人： Blaine H. Dolph , David M. Lubensky , Mal Pattiarachi , Marcus D. Roy , Justin Weisz

IPC分类号： G06F3/16 , G06N99/00 , G10L15/26 , G10L13/04 , G10L15/06

摘要： Techniques for optimizing training data within voice user interface (VUI) of an application under development are disclosed. A VUI feedback module synthesizes human speech of a training phrase. This phrase is presented upon a speaker which is simultaneously captured upon a microphone. A speech to text framework converts the synthesized training phrase into text (textualized training phrase). The VUI feedback module compares the textualized training phrase to the actual training phrase and generates a speech training data structure that identifies similarities or dissimilarities between the textualized training phrase and the actual training phrase. This data structure may be utilized by an application developer computing system to identify training data that is most venerable to misinterpretation when a user interacts with the VUI. The VUI may subsequently be adjusted to account for the vulnerabilities to improve operations or user experience of the VUI.

5.

发明申请
CREATING SYNTHETIC VISUAL INSPECTION DATA SETS USING AUGMENTED REALITY 有权

公开(公告)号：US20230027216A1

公开(公告)日：2023-01-26

申请号：US17380075

申请日：2021-07-20

申请人： International Business Machines Corporation

发明人： Michael Charles Hollinger , Mal Pattiarachi , ABHINAV PRATAP SINGH

IPC分类号： G06K9/62 , G06T19/00 , G06T19/20 , G06K9/00 , G06N20/00

摘要： In an approach for creating synthetic visual inspection data sets for training an artificial intelligence computer vision deep learning model utilizing augmented reality, a processor enables a user to capture a plurality of images of an anchor object using a camera on a user computing device. A processor receives the plurality of images of the anchor object from the user. A processor generates a baseline model of an anchor object. A processor generates a training data set. A processor trains the baseline model of the anchor object. A processor creates a trained Artificial Intelligence (AI) computer vision deep learning model. A processor enables the user to interact with the trained AI computer vision deep learning model in an access mode.

6.

发明授权
Automated voice enablement of applications 有权

公开(公告)号：US10585640B2

公开(公告)日：2020-03-10

申请号：US15790160

申请日：2017-10-23

申请人： International Business Machines Corporation

发明人： Blaine H. Dolph , David M. Lubensky , Mal Pattiarachi , Marco Pistoia , Nitendra Rajput , Justin Weisz

IPC分类号： G06F3/048 , G06F3/16 , G06F17/27 , G06F8/38 , G06F3/0484

摘要： Techniques are disclosed for generating a voice user interface (VUI) modality within an application that includes graphical user interface (GUI) screens. A GUI screen parser analyzes the GUI screens to determine the various navigational GUI screen paths that are associated with edge objects within multiple GUI screens. Some edge objects are identified as select objects or prompt objects. A natural language processing system generates a select object synonym data structure and a prompt object data structure that may be utilized by a VUI generator to generate VUI data structures that give the application VUI modality.

7.

发明授权
Automated voice enablement of applications 有权

公开(公告)号：US10481865B2

公开(公告)日：2019-11-19

申请号：US15790162

申请日：2017-10-23

申请人： International Business Machines Corporation

发明人： Blaine H. Dolph , David M. Lubensky , Mal Pattiarachi , Marco Pistoia , Nitendra Rajput , Justin Weisz

IPC分类号： G06F3/048 , G06F3/16 , G06F3/0484 , G06F17/27 , G06F8/38 , G06F17/28 , G06F9/451

摘要： Techniques are disclosed for generating a voice user interface (VUI) modality within an application that includes graphical user interface (GUI) screens. A GUI screen parser analyzes the GUI screens to determine the various navigational GUI screen paths that are associated with edge objects within multiple GUI screens. Some edge objects are identified as select objects or prompt objects. A natural language processing system generates a select object synonym data structure and a prompt object data structure that may be utilized by a VUI generator to generate VUI data structures that give the application VUI modality.

8.

发明申请
Automated Voice Enablement of Applications 审中-公开

公开(公告)号：US20190121609A1

公开(公告)日：2019-04-25

申请号：US15790162

申请日：2017-10-23

申请人： International Business Machines Corporation

发明人： Blaine H. Dolph , David M. Lubensky , Mal Pattiarachi , Marco Pistoia , Nitendra Rajput , Justin Weisz

IPC分类号： G06F3/16 , G06F3/0484 , G06F9/44

摘要： Techniques are disclosed for generating a voice user interface (VUI) modality within an application that includes graphical user interface (GUI) screens. A GUI screen parser analyzes the GUI screens to determine the various navigational GUI screen paths that are associated with edge objects within multiple GUI screens. Some edge objects are identified as select objects or prompt objects. A natural language processing system generates a select object synonym data structure and a prompt object data structure that may be utilized by a VUI generator to generate VUI data structures that give the application VUI modality.

9.

发明授权
Training data optimization for voice enablement of applications 有权

公开(公告)号：US10553203B2

公开(公告)日：2020-02-04

申请号：US15807956

申请日：2017-11-09

申请人： International Business Machines Corporation

发明人： Blaine H. Dolph , David M. Lubensky , Mal Pattiarachi , Marcus D. Roy , Justin Weisz

IPC分类号： G10L15/01 , G10L15/06 , G06F3/16 , G10L13/04 , G10L15/26 , G06N20/00

摘要： Techniques for optimizing training data within voice user interface (VUI) of an application under development are disclosed. A VUI feedback module synthesizes human speech of a training phrase. This phrase is presented upon a speaker which is simultaneously captured upon a microphone. A speech to text framework converts the synthesized training phrase into text (textualized training phrase). The VUI feedback module compares the textualized training phrase to the actual training phrase and generates a speech training data structure that identifies similarities or dissimilarities between the textualized training phrase and the actual training phrase. This data structure may be utilized by an application developer computing system to identify training data that is most venerable to misinterpretation when a user interacts with the VUI. The VUI may subsequently be adjusted to account for the vulnerabilities to improve operations or user experience of the VUI.

10.

发明申请
Training Data Optimization for Voice Enablement of Applications 审中-公开

公开(公告)号：US20190138269A1

公开(公告)日：2019-05-09

申请号：US15807956

申请日：2017-11-09

申请人： International Business Machines Corporation

发明人： Blaine H. Dolph , David M. Lubensky , Mal Pattiarachi , Marcus D. Roy , Justin Weisz

IPC分类号： G06F3/16 , G06N99/00 , G10L15/26 , G10L13/04 , G10L15/06

摘要： Techniques for optimizing training data within voice user interface (VUI) of an application under development are disclosed. A VUI feedback module synthesizes human speech of a training phrase. This phrase is presented upon a speaker which is simultaneously captured upon a microphone. A speech to text framework converts the synthesized training phrase into text (textualized training phrase). The VUI feedback module compares the textualized training phrase to the actual training phrase and generates a speech training data structure that identifies similarities or dissimilarities between the textualized training phrase and the actual training phrase. This data structure may be utilized by an application developer computing system to identify training data that is most venerable to misinterpretation when a user interacts with the VUI. The VUI may subsequently be adjusted to account for the vulnerabilities to improve operations or user experience of the VUI.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类