摘要:
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for performing speaker verification. A system configured to practice the method receives a request to verify a speaker, generates a text challenge that is unique to the request, and, in response to the request, prompts the speaker to utter the text challenge. Then the system records a dynamic image feature of the speaker as the speaker utters the text challenge, and performs speaker verification based on the dynamic image feature and the text challenge. Recording the dynamic image feature of the speaker can include recording video of the speaker while speaking the text challenge. The dynamic feature can include a movement pattern of head, lips, mouth, eyes, and/or eyebrows of the speaker. The dynamic image feature can relate to phonetic content of the speaker speaking the challenge, speech prosody, and the speaker's facial expression responding to content of the challenge.
摘要:
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for performing speaker verification. A system configured to practice the method receives a request to verify a speaker, generates a text challenge that is unique to the request, and, in response to the request, prompts the speaker to utter the text challenge. Then the system records a dynamic image feature of the speaker as the speaker utters the text challenge, and performs speaker verification based on the dynamic image feature and the text challenge. Recording the dynamic image feature of the speaker can include recording video of the speaker while speaking the text challenge. The dynamic feature can include a movement pattern of head, lips, mouth, eyes, and/or eyebrows of the speaker. The dynamic image feature can relate to phonetic content of the speaker speaking the challenge, speech prosody, and the speaker's facial expression responding to content of the challenge.
摘要:
An audio conference is supplemented based on speaker content. Speaker content from at least one audio conference participant is monitored using a computer with a tangible non-transitory processor and memory. A set of words is selected from the speaker content. The selected set of words is determined to be associated with supplemental media content from at least one external source. The supplemental media content is made available to at least one audience member for the audio conference. The supplemental media content is selectively presented to the at least one audience member.
摘要:
A method includes receiving video data and identifying a second object in at least one video frame of the video data. The method also includes determining whether to replace the second object in the at least one video frame with a first object based on at least one object matching rule. In response to determining that the second object is to be replaced with the first object, the method includes manipulating the three-dimensional model of the first object to generate a representation of the first object that matches at least one visual property of the second object and replacing the second object with the representation of the first object in the at least one video frame.
摘要:
Speaker content generated in an audio conference is visually represented in accordance with a method. Speaker content from a plurality of audio conference participants is monitored using a computer with a tangible non-transitory processor and memory. The speaker content from each of the plurality of audio conference participants is monitored. A visual representation of speaker content for each of the plurality of audio conference participants is generated based on the analysis of the speaker content from each of the plurality of audio conference participant. The visual representation of speaker content is displayed.
摘要:
Telepresence is coordinated among multiple interconnected devices. The presence of a first interconnected device and a second interconnected device in a common space is determined. Multimedia capabilities of the first interconnected device and the second interconnected device are determined. Communications of at least one type of media information using one of the first interconnected device and the second interconnected device are selectively and temporarily enabled by an external controller over a second network. Communications of the at least one type of media information using the other of the first interconnected device and the second interconnected device are selectively and temporarily not enabled by the external controller over the second network.
摘要:
Methods, systems, and computer program calibrate a vision system. An image of a human gesture is received that frames a display device. A boundary defined by the human gesture is computed, and gesture area defined by the boundary is also computed. The gesture area is then mapped to pixels in the display device.
摘要:
A computer implemented method is disclosed, the method including but not limited to detecting an event of interest in video conference data for a plurality of video conference participants and notifying an end user of the event of interest. A computer readable medium is also disclosed for containing a computer program for performing the method. A computer implemented method is also disclosed for receiving at an end user device, a notification of an event of interest in a video teleconference, the method including but not limited to receiving at an end user device from a notification indicating a detection of the event of interest in video conference data from the video teleconference for a plurality of video conference participants; and sending data from the end user device to the server requesting a transcription of comments from the speaker in video teleconference.
摘要:
A computer implemented method is disclosed, the method including but not limited to detecting an event of interest in video conference data for a plurality of video conference participants and notifying an end user of the event of interest. A computer readable medium is also disclosed for containing a computer program for performing the method. A computer implemented method is also disclosed for receiving at an end user device, a notification of an event of interest in a video teleconference, the method including but not limited to receiving at an end user device from a notification indicating a detection of the event of interest in video conference data from the video teleconference for a plurality of video conference participants; and sending data from the end user device to the server requesting a transcription of comments from the speaker in video teleconference.
摘要:
A method and apparatus for a service platform capable of providing device-based task completion is disclosed. A request for a task is received at a service platform from a customer. A worker device to complete the task is selected from a group of worker devices registered with the service platform based on a current attribute of the worker device. Data resulting from completion of the task is received from the selected worker device, validated, and presented to the customer. A reward or incentive can be provided to the worker device in response to the data being received from the worker device and validated.