VIDEO GENERATION METHOD AND SYSTEM FOR HIGH RESOLUTION FACE SWAPPING

    公开(公告)号:US20230112462A1

    公开(公告)日:2023-04-13

    申请号:US17623247

    申请日:2021-08-09

    Abstract: A video generation method includes: obtaining a target face image and a source face image; extracting a feature of each of the source face image and the target face image through a face feature encoder, to obtain corresponding source feature codes and target feature codes; generating swapped face feature codes through a face feature exchanger according to the source feature codes and the target feature codes; generating an initial swapped face image through a face generator according to the swapped face feature codes; and fusing the initial swapped face image with the target face image through a face fuser, to obtain a final swapped face image. The face feature encoder performs hierarchical encoding on the face feature to reserve semantic details of a face, and the face feature exchanger performs further processing based on the hierarchical encoding, to obtain hierarchical encoding of a swapped face feature with semantic details.

    SIGNAL AMPLITUDE FEATURE-BASED METHOD FOR FAST RECONSTRUCTING A MAGNETIC PARTICLE IMAGING AND DEVICE

    公开(公告)号:US20230027988A1

    公开(公告)日:2023-01-26

    申请号:US17811738

    申请日:2022-07-11

    Abstract: The present disclosure includes: transforming a time-domain voltage signal collected by an MPI system device to a frequency domain; calculating a square root of a square sum of a real part and an imaginary part at each frequency point of a frequency domain signal; arranging acquired amplitudes in a descending order, and acquiring a screening threshold by an amplitude ratio method; screening an amplitude through the screening threshold and constructing frequency domain signal data; acquiring a row vector of a system matrix corresponding to each frequency point of the data, so as to construct an update system matrix; and solving, based on the frequency domain signal array and the update system matrix, an inverse problem in a form of a least square based on an L2 constraint to obtain a three-dimensional magnetic particle concentration distribution result, so as to achieve a fast reconstruction of the MPI system.

    Method for obtaining digital audio tampering evidence based on phase deviation detection

    公开(公告)号:US11521629B1

    公开(公告)日:2022-12-06

    申请号:US17668104

    申请日:2022-02-09

    Abstract: Disclosed is a digital audio tampering forensics method based on phase offset detection, comprising: multiplying a signal to be identified with a time label to obtain a modulation signal of the signal to be identified; then, performing a short-time Fourier transform on the signal to be identified and the modulation signal to obtain a signal power spectrum and a modulation signal power spectrum; computing group delay characteristics by using the signal power spectrum and the modulation signal power spectrum; computing a mean value of the group delay characteristics, and then using the mean value results for smoothing computation to obtain phase information of a current frame signal; computing a dynamic threshold by using the phase information of the current frame signal, and then deciding whether the signal is tampered by using the dynamic threshold and the phase information of the current frame signal.

    System for speech recognition text enhancement fusing multi-modal semantic invariance

    公开(公告)号:US11488586B1

    公开(公告)日:2022-11-01

    申请号:US17867937

    申请日:2022-07-19

    Abstract: Disclosed is a system for speech recognition text enhancement fusing multi-modal semantic invariance, the system includes an acoustic feature extraction module, an acoustic down-sampling module, an acoustic feature extraction module, an acoustic down-sampling module, an encoder and a decoder fusing multi-modal semantic invariance; the acoustic feature extraction module is configured for frame-dividing processing of speech data, dividing the speech data into short-term audio frames with a fixed length, extracting thank acoustic features from the short-term audio frames, and inputting the acoustic features into the acoustic down-sampling module for down-sampling to obtain an acoustic representation; inputting the speech data into an existing speech recognition module to obtain input text data, and inputting the input text data into the encoder to obtain an input text encoded representation; inputting the acoustic representation and the input text encoded representation into the decoder to fuse.

    End-to-end system for speech recognition and speech translation and device

    公开(公告)号:US11475877B1

    公开(公告)日:2022-10-18

    申请号:US17852140

    申请日:2022-06-28

    Abstract: Disclosed are an end-to-end system for speech recognition and speech translation and an electronic device. The system comprises an acoustic encoder and a multi-task decoder and a semantic invariance constraint module, and completes two tasks for speech recognition and speech translation. In addition, according to the characteristic of the semantic consistency of texts between different tasks, semantic constraints are imposed on the model to learn high-level semantic information, and the semantic information can effectively improve the performance of speech recognition and speech translation. The application has the following advantages that the error accumulation problem of serial system is avoided, and the calculation cost of the model is low and the real-time performance is very high.

    METHOD FOR GENERATING TRANSCRANIAL MAGNETIC STIMULATION (TMS) COIL POSE ATLAS BASED ON ELECTROMAGNETIC SIMULATING CALCULATION

    公开(公告)号:US20220218220A1

    公开(公告)日:2022-07-14

    申请号:US17539150

    申请日:2021-11-30

    Abstract: A method for generating a transcranial magnetic stimulation (TMS) coil pose atlas based on electromagnetic simulating calculation includes: constructing coil array positions and orientations of a scalp in a standard Montreal Neurological Institute (MNI) space and matching the coil array positions and orientations of the scalp to a brain space of an individual to obtain coil array positions and orientations of the brain space of the individual; using a finite element calculation method to simulate the coil array positions of the brain space of the individual to obtain induced electric field distributions of brain tissue in different coil orientations; obtaining optimal regulation effects based on the induced electric field distributions of the brain tissue; and obtaining a coil position and orientation corresponding to each optimal regulation effect as an optimal coil pose of each divided brain area of the individual, and constructing a TMS coil pose atlas of the individual.

Patent Agency Ranking