FeatureBase Glossary of terms
0-9
Term | Context | Further information |
---|---|---|
1,048,576 | FeatureBase Community batch-size ingest flag | CSV ingest flags |
A
Term | Context | Further information |
---|---|---|
ALL | PQL query | PQL ALL read query |
Anti-entropy | FeatureBase Community cluster | A periodic process that compares each shard and its replicas across the cluster to repair inconsistencies. |
APPLY | PQL query | PQL APPLY read query |
ARROW | PQL query | PQL ARROW read query |
Authentication | FeatureBase Cloud | Manage cloud users |
Authentication | FeatureBase Community | Community authentication |
B
Term | Context | Further information |
---|---|---|
Batch (ingest) | Data import to FeatureBase | BULK INSERT statement |
Bitmap Bitmap Index (BMI) Roaring B-Tree format (RBT) | FeatureBase database table rows | FeatureBase uses the Roaring Bitmap format to store data. |
Bit Sliced Indexing (BSI) | Multi-bit integer and timestamp data types used for Range, Min, Max and Sum queries | * INT data type * TIMESTAMP data type * MIN query * MAX query * SUM query |
C
Term | Context | Further information |
---|---|---|
CLEAR | PQL query | PQL CLEAR write query |
CLEARROW | PQL query | PQL CLEARROW write query |
Cluster | FeatureBase Community | A Cluster configuration of FeatureBase nodes where data is evenly distributed and any node can respond to queries. Also defines how data is replicated and inter-node communication. |
CONSTROW | PQL query | PQL CONSTROW read query |
COUNT | PQL query | PQL COUNT read query |
D
Term | Context | Further information |
---|---|---|
Database | FeatureBase database | Dedicated resources which contain tables and data. Manage Cloud databases |
Data source | Source of data imported to FeatureBase | FeatureBase imports data from external data sources via HTTPS, Kafka, SQL or CSV ingest processing |
Data types | Table columns | Data types and constraints |
DELETE | PQL query | PQL DELETE write query |
DIFFERENCE | PQL query | PQL DIFFERENCE read query |
DISTINCT | PQL query | PQL DISTINCT read query |
E
Term | Context | Further information |
---|---|---|
EXTRACT | PQL query | PQL EXTRACT read query |
F
Term | Context | Further information |
---|---|---|
Field | Table rows | Field data types to group rows into different categories: * bool * int * set * time * timestamp * Mutex |
Fields, ranked | Table rows | Rows kept in sorted order within the field. |
Fragment | FeatureBase Community Row fields and database shards | A fragment typically corresponds to a file on disk which represents an intersection of: * field and shard, or * field, shard and time data type time quantum constraint |
G
Term | Context | Further information |
---|---|---|
Group By | PQL Query | PQL Group By Query |
H
Term | Context | Further information |
---|---|---|
I
Term | Context | Further information |
---|---|---|
INCLUDESCOLUMN | PQL query | PQL INCLUDESCOLUMN read query |
INTERSECT | PQL query | PQL INTERSECT read query |
Index | FeatureBase tables | Denormalized top-level container roughly the same as an RDBMS table. |
Ingest | Importing data to FeatureBase | Manage community ingest |
J
Term | Context | Further information |
---|---|---|
K
Term | Context | Further information |
---|---|---|
L
Term | Context | Further information |
---|---|---|
LIMIT | PQL query | PQL LIMIT read query |
M
Term | Context | Further information | ||
---|---|---|---|---|
MAX | PQL Read query | PQL MAX Read query | ||
MAX | SQL int constraint | INT data type | ||
MAXSHARDcluster.maxshard parameter | FeatureBase Community Cluster flag | Zero-indexed shard allocation for current records, where maxshard = 0 indicates 1 shard is allocated. | ||
MIN | PQL Read query | PQL MIN Read query | ||
Min | SQL int constraint | INT data type | ||
molecula-consumer-[csv | sql | kafka] | Community ingest | Manage community ingest |
Mutex | String Data type | A FeatureBase field type similar to the Set type, in which only a single value can be set at any time. Conceptually similar to an enum type, but implemented on top of Set fields, with a performance cost from the single-value constraint. Not to be confused with the mutex synchronization primitive. |
N
Term | Context | Further information |
---|---|---|
Node | FeatureBase Community Cluster | An individual running instance of FeatureBase server which belongs to a cluster. |
NOT | PQL query | PQL NOT read query |
O
Term | Context | Further information |
---|---|---|
Options | PQL Options query | PQL OPTIONS |
Organization | FeatureBase Cloud | FeatureBase Organization |
P
Term | Context | Further information |
---|---|---|
PERCENTILE | PQL query | PQL PERCENTILE read query |
Pilosa | Former name of FeatureBase | Pilosa + Molecula = FeatureBase blog post |
Pilosa Query Language (PQL) | Database queries | PQL-Guide |
Protobuf | Binary serialization format used for internal messages which can be used by clients as an alternative to JSON. Protobuf |
Q
Term | Context | Further information |
---|---|---|
Query (PQL) | Pilosa Query Language | PQL Guide |
Query (SQL) | Structured Query Language | SQL Guide |
R
Term | Context | Further information |
---|---|---|
Record Row | Database table row | Equivalent to RDBMS table row. FeatureBase uses “Record” to avoid confusion |
Replicacluster.replicas parameter | FeatureBase Community Clusters | Replica of shard within a cluster. cluster.replicas configuration parameter determines the number of shard replicas within a cluster where replicas=1 indicates no copies have been made. |
Roaring Bitmap | FeatureBase database | roaringbitmap.org |
Row | Rows are the fundamental vertical data axis within FeatureBase. Rows are namespaced by field so the same row ID in a different field refers to a different row. | |
Row _id | ||
Row (Ranged) | PQL query | PQL Row read query |
Row (Timestamp) | PQL query | PQL Row read query |
Rows | PQL query | PQL Rows read query |
S
Term | Context | Further information |
---|---|---|
SET | PQL query | PQL SET write query |
Shard | FeatureBase Community Cluster | Records are sharded on a preset width. Shards are operated on in parallel and are evenly distributed across the cluster via a consistent hash. |
ShardWidthcluster.shardwidth parameter | FeatureBase Community Cluster | This is the number of records in a shard. ShardWidth defaults to 2^20 or about one million. It can be modified, but only at compile time, and before ingesting any data. |
SORT | PQL query | PQL SORT read query |
STORE | PQL query | PQL STORE write query |
SUM | PQL query | PQL SUM read query |
T
Term | Context | Further information |
---|---|---|
Time Quantum | SQL IDSET and STRINGSET constraint | IDSET data type STRINGSET data type |
Timestamp | Data type | Timestamp data type |
TTL (Time To Live) | IDSET and STRINGSET constraint | IDSET data type STRINGSET data type |
TOML | FeatureBase Community configuration files | Tom’s Obvious Minimal Language (TOML) |
TopK | PQL query | PQL TOPK read query |
TopN | PQL query | PQL TOPN read query |
U
Term | Context | Further information |
---|---|---|
UNION | PQL query | PQL UNION read query |
UNIONROWS | PQL query | PQL UNIONROWS read query |
V
Term | Context | Further information |
---|---|---|
View | FeatureBase fields | Internally managed method to separate data layouts within a field. Not exposed by the API |
View (Primary) | FeatureBase fields | Standard view that represents typical base data |
View (Time-based) | FeatureBase fields | Automatically generated view for time quantum fields |
W - X - Y - Z
Term | Context | Further information |
---|---|---|
XOR | PQL query | PQL XOR read query |