fst

用有限状态传感器紧凑地表示大型集合和映射。(Represent large sets and maps compactly with finite state transducers.)

  • 所有者: BurntSushi/fst
  • 平台: Linux, Windows
  • 许可证: The Unlicense
  • 分类:
  • 主题:
  • 喜欢:
    0
      比较:

Github星跟踪图

fst

This crate provides a fast implementation of ordered sets and maps using finite
state machines. In particular, it makes use of finite state transducers to map
keys to values as the machine is executed. Using finite state machines as data
structures enables us to store keys in a compact format that is also easily
searchable. For example, this crate leverages memory maps to make range queries
very fast.

Check out my blog post
Index 1,600,000,000 Keys with Automata and
Rust

for extensive background, examples and experiments.

Linux build status
Windows build status

Dual-licensed under MIT or the UNLICENSE.

Documentation

Full API documentation and examples.

The
fst-regex
and
fst-levenshtein
crates provide regular expression matching and fuzzy searching on FSTs,
respectively.

Installation

Simply add a corresponding entry to your Cargo.toml dependency list:

[dependencies]
fst = "0.3"

And add this to your crate root:

extern crate fst;

Example

This example demonstrates building a set in memory and executing a fuzzy query
against it. You'll need fst = "0.3" and fst-levenshtein = "0.2" in your
Cargo.toml.

extern crate fst;
extern crate fst_levenshtein;

use std::error::Error;
use std::process;

use fst::{IntoStreamer, Set};
use fst_levenshtein::Levenshtein;

fn try_main() -> Result<(), Box<Error>> {
  // A convenient way to create sets in memory.
  let keys = vec!["fa", "fo", "fob", "focus", "foo", "food", "foul"];
  let set = Set::from_iter(keys)?;

  // Build our fuzzy query.
  let lev = Levenshtein::new("foo", 1)?;

  // Apply our fuzzy query to the set we built.
  let stream = set.search(lev).into_stream();

  let keys = stream.into_strs()?;
  assert_eq!(keys, vec!["fo", "fob", "foo", "food"]);
  Ok(())
}

fn main() {
  if let Err(err) = try_main() {
    eprintln!("{}", err);
    process::exit(1);
  }
}

Check out the documentation for a lot more examples!

主要指标

概览
名称与所有者BurntSushi/fst
主编程语言Rust
编程语言Rust (语言数: 2)
平台Linux, Windows
许可证The Unlicense
所有者活动
创建于2015-09-05 00:25:46
推送于2024-09-25 20:46:04
最后一次提交2024-09-25 16:46:03
发布数65
最新版本名称fst-bin-0.4.3 (发布于 2023-03-12 19:28:31)
第一版名称0.1.1 (发布于 2015-10-18 11:03:01)
用户参与
星数1.9k
关注者数29
派生数130
提交数271
已启用问题?
问题数88
打开的问题数26
拉请求数44
打开的拉请求数13
关闭的拉请求数27
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?