Probabilistic neural network architecture generation
摘要:
Examples of the present disclosure describe systems and methods for probabilistic neural network architecture generation. In an example, an underlying distribution over neural network architectures based on various parameters is sampled using probabilistic modeling. Training data is evaluated in order to iteratively update the underlying distribution, thereby generating a probability distribution over the neural network architectures. The distribution is iteratively trained until the parameters associated with the neural network architecture converge. Once it is determined that the parameters have converged, the resulting probability distribution may be used to generate a resulting neural network architecture. As a result, intermediate architectures need not be fully trained, which dramatically reduces memory usage and/or processing time. Further, in some instances, it is possible to evaluate bigger architectures and/or larger batch sizes while also reducing neural network architecture generation time and maintaining or improving neural network accuracy.
公开/授权文献
信息查询
0/0