OPTIMIZING LARGE LANGUAGE MODELS WITH DOMAIN-ORIENTED MODEL COMPRESSION

    公开(公告)号:US20250061334A1

    公开(公告)日:2025-02-20

    申请号:US18805978

    申请日:2024-08-15

    Abstract: Systems and methods for optimizing large language models (LLM) with domain-oriented model compression. Importance weights for general knowledge in a trained LLM, pretrained with deep learning, can be determined by computing the error when removing a weight from the trained LLM. The trained LLM can be iteratively optimized to obtain a domain-compressed LLM with domain knowledge while maintaining general knowledge by: fine-tuning the trained LLM iteratively with domain knowledge using the importance weights for general knowledge to obtain a fine-tuned LLM; determining importance weights for domain knowledge in the LLM with a regularization term by using gradient descent to optimize parameters when the fine-tuned LLM is trained with domain knowledge; and pruning learned knowledge based on importance weights for domain knowledge. A corrective action can be performed on a monitored entity using the domain-compressed LLM.

    Long distance distributed fiber optic sensing (DFOS) and WDM communications over repeated fiber spans using all-raman amplification and coding schemes

    公开(公告)号:US12206489B2

    公开(公告)日:2025-01-21

    申请号:US18109247

    申请日:2023-02-13

    Abstract: Aspects of the present disclosure describe distributed fiber optic sensing (DFOS) systems, methods, and structures that advantageously provide DFOS and WDM communications over amplified, multi-span optical WDM optical telecommunications facilities using all Raman amplification and coding schemes. Our all-Raman amplification operates stably—without isolators—and provides sufficient gain to compensate for fiber span loss for both DFOS signals and WDM channel signals—at the same time. Furthermore, our inventive techniques employ signal coding, such as MB-TGD-OFDR for DAS, and we operate our DFOS operation power at a much lower power level as compared to pulse interrogation techniques. With improved OSNR and reduced power using signal coding along with our distributed Raman amplification, our DFOS systems can co-exist with WDM communication channels on the same amplified multi-span fiber optic links over great distances.

    Learning ordinal representations for deep reinforcement learning based object localization

    公开(公告)号:US12205357B2

    公开(公告)日:2025-01-21

    申请号:US17715901

    申请日:2022-04-07

    Abstract: A reinforcement learning based approach to the problem of query object localization, where an agent is trained to localize objects of interest specified by a small exemplary set. We learn a transferable reward signal formulated using the exemplary set by ordinal metric learning. It enables test-time policy adaptation to new environments where the reward signals are not readily available, and thus outperforms fine-tuning approaches that are limited to annotated images. In addition, the transferable reward allows repurposing of the trained agent for new tasks, such as annotation refinement, or selective localization from multiple common objects across a set of images. Experiments on corrupted MNIST dataset and CU-Birds dataset demonstrate the effectiveness of our approach.

    Multi-hop evidence pursuit
    515.
    发明授权

    公开(公告)号:US12205027B2

    公开(公告)日:2025-01-21

    申请号:US17840987

    申请日:2022-06-15

    Abstract: A method for neural network training is provided. The method inputs a training set of textual claims, lists of evidence including gold evidence chains, and claim labels labelling the evidence with respect to the textual claims. The claim labels include refutes, supports, and not enough information (NEI). The method computes an initial set of document retrievals for each of the textual claims. The method also includes computing an initial set of page element retrievals including sentence retrievals from the initial set of document retrievals for each of the textual claims. The method creates, from the training set of textual claims, a Leave Out Training Set which includes input texts and target texts relating to the labels. The method trains a sequence-to-sequence neural network to generate new target texts from new input texts using the Leave Out Training Set.

    ENCODING AND DECODING IMAGES USING DIFFERENTIABLE JPEG COMPRESSION

    公开(公告)号:US20250008132A1

    公开(公告)日:2025-01-02

    申请号:US18755150

    申请日:2024-06-26

    Abstract: Systems and methods are provided for encoding and decoding images using differentiable JPEG compression, including converting images from RGB color space to YCbCr color space to obtain a luminance and chrominance channels, and applying chroma subsampling to the chrominance channels to reduce resolution. The YCbCr image is divided into pixel blocks and a DCT is performed on the pixel blocks to obtain DCT coefficients. DCT coefficients are quantized using a scaled quantization table to reduce precision, and quantized DCT coefficients are encoded using lossless entropy coding, forming a compressed JPEG file decoded by reversing the lossless entropy coding to obtain quantized DCT coefficients, which are dequantized using the scaled quantization table to restore the precision. The dequantized DCT coefficients are converted back to a spatial domain using an IDCT, the chrominance channels are upsampled to original resolution, and the YCbCr image is converted back to the RGB color space.

    Dynamic anomaly localization of utility pole wires

    公开(公告)号:US12160090B2

    公开(公告)日:2024-12-03

    申请号:US18492902

    申请日:2023-10-24

    Abstract: Systems and methods for performing the dynamic anomaly localization of utility pole aerial/suspended/supported wires/cables by distributed fiber optic sensing. In sharp contrast to the prior art, our inventive systems and methods according to aspects of the present disclosure advantageously identify a “location region” on a utility pole supporting an affected wire/cable, thereby permitting the identification and reporting of service personnel that are uniquely responsible for responding to such anomalous condition(s).

    WEIGHT ATTENTION FOR TRANSFORMERS IN MEDICAL DECISION MAKING MODELS

    公开(公告)号:US20240394524A1

    公开(公告)日:2024-11-28

    申请号:US18670275

    申请日:2024-05-21

    Inventor: Iain Melvin

    Abstract: Methods and systems for configuring a machine learning model include selecting a head from a set of stored heads, responsive to an input, to implement a layer in a transformer machine learning model. The selected head is copied from persistent storage to active memory. The layer in the transformer machine learning model is executed on the input using the selected head to generate an output. An action is performed responsive to the output.

    FEDERATED IMITATION LEARNING FOR MEDICAL DECISION MAKING

    公开(公告)号:US20240371521A1

    公开(公告)日:2024-11-07

    申请号:US18649072

    申请日:2024-04-29

    Abstract: Methods and systems for skill prediction include aggregating locally trained parameters from client systems to generate updated global parameters. Parameterized vectors from the client systems are clustered into prototype clusters. A centroid of each prototype cluster is determined and the parameterized vectors from the client systems are matched to centroids of the prototype clusters to identify sets of updated local prototype vectors. The updated global parameters and the updated local prototype vectors are distributed to the client systems.

    Dynamic, contextualized AI models
    520.
    发明授权

    公开(公告)号:US12136255B2

    公开(公告)日:2024-11-05

    申请号:US17577664

    申请日:2022-01-18

    Abstract: A method for employing a semi-supervised learning approach to improve accuracy of a small model on an edge device is presented. The method includes collecting a plurality of frames from a plurality of video streams generated from a plurality of cameras, each camera associated with a respective small model, each small model deployed in the edge device, sampling the plurality of frames to define sampled frames, performing inference to the sampled frames by using a big model, the big model shared by all of the plurality of cameras and deployed in a cloud or cloud edge, using the big model to generate labels for each of the sampled frames to generate training data, and training each of the small models with the training data to generate updated small models on the edge device.

Patent Agency Ranking