Tokenizers

为研究和生产而优化的快速先进的分词器。「💥 Fast State-of-the-Art Tokenizers optimized for Research and Production」

Main metrics

Overview

Name With Ownerhuggingface/tokenizers
Primary LanguageRust
Program languageRust (Language Count: 8)
Platform
License:Apache License 2.0
Release Count146
Last Release Namev0.21.4 (Posted on )
First Release Namev0.0.3 (Posted on )
Created At2019-11-01 17:52:20
Pushed At2025-07-29 13:32:04
Last Commit At
Stargazers Count9976
Watchers Count125
Fork Count951
Commits Count1886
Has Issues Enabled
Issues Count1086
Issue Open Count79
Pull Requests Count554
Pull Requests Open Count27
Pull Requests Close Count160
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private
To the top