DATA LABELING METHOD BASED ON ARTIFICIAL INTELLIGENCE, APPARATUS AND STORAGE MEDIUM

    公开(公告)号:US20230316709A1

    公开(公告)日:2023-10-05

    申请号:US17902323

    申请日:2022-09-02

    CPC classification number: G06V10/762 G06V10/764 G06V10/761 G06F16/285

    Abstract: Provided is a data labeling method based on artificial intelligence, an apparatus, and a storage medium relating to the field of artificial intelligence, particularly data labeling, image recognition, and natural language processing. The method includes: determining a plurality of samples involved in clustering; performing a plurality of following operations circularly to realize iterative processing, until a convergence condition is satisfied or a quantity of iterations reaches a number threshold, comprising: pre-clustering the plurality of samples according to a vector representation of the respective samples to obtain a plurality of class clusters, each class cluster containing at least one sample; receiving labeling information for the respective class clusters and re-determining the plurality of samples according to the labeling information; and determining a clustering result according to the labeling information for the respective class clusters.

Patent Agency Ranking