LBANN：利弗莫尔大型人工神经网络工具包

利弗莫尔大型人工神经网络工具包（LBANN）是一个开源的、以 HPC 为中心的深度学习训练框架，它被优化为多层次的并行性组合。

LBANN 通过领域分解提供模型并行加速，以优化网络训练的强大扩展性。它还允许将模型并行性与数据并行性和集合训练方法结合起来，用大量的数据训练大型神经网络。LBANN 能够利用紧密耦合的加速器、低延迟高带宽网络和高带宽并行文件系统的优势。

除了传统的监督学习之外，LBANN 还支持最先进的训练算法，如无监督、自监督和对抗性（GAN）训练方法。它还支持通过时间反向传播（BPTT）训练的递归神经网络、转移学习以及多模型和集合训练方法。

运行 LBANN

运行 LBANN 的基本模板是：

<mpi-launcher> <mpi-options> \ lbann <lbann-options> \ --model=model.prototext \ --optimizer=opt.prototext \ --reader=data_reader.prototext

当使用GPGPU加速器时，用户应该注意LBANN是针对每个MPI等级分配一个GPU的情况而优化的。在选择MPI启动器的参数时，应牢记这一点。

关于运行 LBANN 的更多细节记录在此。

Name With Owner	LBANN/lbann
Primary Language	C++
Program language	CMake (Language Count: 4)
Platform	Linux, Mac
License:	Other

Name With Owner

LBANN/lbann

Primary Language

C++

Program language

CMake (Language Count: 4)

Platform

Linux, Mac

License:

Other

Created At	2016-05-11 20:04:20
Pushed At	2025-05-09 20:17:29
Last Commit At
Release Count	25
Last Release Name	2024_10_17_v0.105_pre_release (Posted on 2024-10-17 17:50:53)
First Release Name	v0.9 (Posted on 2016-07-19 13:40:23)

Created At

2016-05-11 20:04:20

Pushed At

2025-05-09 20:17:29

Last Commit At

Release Count

Last Release Name

2024_10_17_v0.105_pre_release (Posted on 2024-10-17 17:50:53)

First Release Name

v0.9 (Posted on 2016-07-19 13:40:23)

Stargazers Count	229
Watchers Count	24
Fork Count	79
Commits Count	1
Has Issues Enabled
Issues Count	473
Issue Open Count	167
Pull Requests Count	1807
Pull Requests Open Count	39
Pull Requests Close Count	153

Stargazers Count

229

Watchers Count

Fork Count

Commits Count

Has Issues Enabled

Issues Count

473

Issue Open Count

167

Pull Requests Count

1807

Pull Requests Open Count

Pull Requests Close Count

153

Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private

Has Wiki Enabled

Is Archived

Is Fork

Is Locked

Is Mirror

Is Private

LBANN: Livermore Big Artificial Neural Network Toolkit

The Livermore Big Artificial Neural Network toolkit (LBANN) is an
open-source, HPC-centric, deep learning training framework that is
optimized to compose multiple levels of parallelism.

LBANN provides model-parallel acceleration through domain
decomposition to optimize for strong scaling of network training. It
also allows for composition of model-parallelism with both data
parallelism and ensemble training methods for training large neural
networks with massive amounts of data. LBANN is able to advantage of
tightly-coupled accelerators, low-latency high-bandwidth networking,
and high-bandwidth parallel file systems.

LBANN supports state-of-the-art training algorithms such as
unsupervised, self-supervised, and adversarial (GAN) training methods
in addition to traditional supervised learning. It also supports
recurrent neural networks via back propagation through time (BPTT)
training, transfer learning, and multi-model and ensemble training
methods.

Building LBANN

The preferred method for LBANN users to install LBANN is to use
Spack. After some system
configuration, this should be as straightforward as

spack install lbann

More detailed instructions for building and installing LBANN are
available at the main LBANN
documentation.

Running LBANN

The basic template for running LBANN is

<mpi-launcher> <mpi-options> \
    lbann <lbann-options> \
    --model=model.prototext \
    --optimizer=opt.prototext \
    --reader=data_reader.prototext

When using GPGPU accelerators, users should be aware that LBANN is
optimized for the case in which one assigns one GPU per MPI
rank. This should be borne in mind when choosing the parameters for
the MPI launcher.

More details about running LBANN are documented
here.

Publications

A list of publications, presentations and posters are shown
here.

Reporting issues

Issues, questions, and bugs can be raised on the Github issue
tracker.

LBANN

Github stars Tracking Chart