LLM

Large Language Model for Long Genome Sequence

Develop a genome LLM model based on BERT architecture and adopt masked language modeling as pretraining strategy.

Jun 1, 2024

PyTorch DistributedDataParallel Training

In this blog post, we will explore the concept of distributed training and delve into the details of PyTorch’s DistributedDataParallel training approach. Some Prerequisite Definitions Process: A process is the basic unit of work in an operating system.

Jan 1, 2024

The Transformer Explained

Transformer architecture explained with minimal PyTorch implementation line-by-line.

Oct 26, 2023