FeatureBase Glossary of terms
0-9
Term | Context | Further information |
---|---|---|
1,048,576 | FeatureBase Community batch-size ingest flag | CSV ingest flags |
A
Term | Context | Further information |
---|---|---|
ALL | PQL query | PQL ALL read query |
Anti-entropy | FeatureBase Community cluster | Anti-entropy is a periodic process that compares each shard and its replicas across a cluster to repair inconsistencies. |
APPLY | PQL query | PQL APPLY read query |
ARROW | PQL query | PQL ARROW read query |
Authentication | FeatureBase Cloud | Manage cloud users |
Authentication | FeatureBase Community | Community authentication |
B
Term | Context | Further information |
---|---|---|
Batch (ingest) | Data import to FeatureBase | BULK INSERT statement |
Bitmap Bitmap Index (BMI) Roaring B-Tree format (RBT) | FeatureBase database table rows | FeatureBase uses the Roaring Bitmap format to store data. |
Bit Sliced Indexing (BSI) | Multi-bit integer and timestamp data types used for Range, Min, Max and Sum queries | * INT data type * TIMESTAMP data type * MIN query * MAX query * SUM query |
C
Term | Context | Further information |
---|---|---|
CLEAR | PQL query | PQL CLEAR write query |
CLEARROW | PQL query | PQL CLEARROW write query |
Cluster | FeatureBase Community | A Cluster configuration of FeatureBase nodes where data is evenly distributed and any node can respond to queries. Also defines how data is replicated and inter-node communication. |
Concurrency | SQL/PQL Queries | Number of concurrent users running queries on data and how this may affect query latency |
CONSTROW | PQL query | PQL CONSTROW read query |
COUNT | PQL query | PQL COUNT read query |
D
Term | Context | Further information |
---|---|---|
Database | FeatureBase database | Dedicated resources which contain tables and data. Manage Cloud databases |
Data source | Source of data imported to FeatureBase | FeatureBase imports data from external data sources via HTTPS, Kafka, SQL or CSV ingest processing |
Data types | Table columns | Data types and constraints |
DELETE | PQL query | PQL DELETE write query |
DIFFERENCE | PQL query | PQL DIFFERENCE read query |
DISTINCT | PQL query | PQL DISTINCT read query |
E
Term | Context | Further information |
---|---|---|
EXTRACT | PQL query | PQL EXTRACT read query |
F
Term | Context | Further information |
---|---|---|
Field | Table rows | Field data types to group rows into different categories: * bool * int * set * time * timestamp * Mutex |
Fields, ranked | Table rows | Rows kept in sorted order within the field. |
Fragment | FeatureBase Community Row fields and database shards | A fragment typically corresponds to a file on disk which represents an intersection of: * field and shard, or * field, shard and time data type time quantum constraint |
Freshness | Data import/ingest | How much time elapses from when a data point is “sensed” by the system, until that data point will affect the results of a query. |
G
Term | Context | Further information |
---|---|---|
Group By | PQL Query | PQL Group By Query |
H
Term | Context | Further information |
---|---|---|
I
Term | Context | Further information |
---|---|---|
INCLUDESCOLUMN | PQL query | PQL INCLUDESCOLUMN read query |
INTERSECT | PQL query | PQL INTERSECT read query |
Index | FeatureBase tables | Denormalized top-level container roughly the same as an RDBMS table. |
Ingest | Importing data to FeatureBase | Manage community ingest |
J
Term | Context | Further information |
---|---|---|
K
Term | Context | Further information |
---|---|---|
L
Term | Context | Further information |
---|---|---|
Latency | SQL/PQL Queries | How much time elapses between when a query is sent to a system and when the results return to the client. |
M
Term | Context | Further information |
---|---|---|
MAX | PQL Read query | PQL MAX Read query |
MAX | SQL int constraint | INT data type |
MAXSHARDcluster.maxshard parameter | FeatureBase Community Cluster flag | Zero-indexed shard allocation for current records, where maxshard = 0 indicates 1 shard is allocated. |
MIN | PQL Read query | PQL MIN Read query |
Min | SQL int constraint | INT data type |
./molecula-consumer-[csv | sql | kafka] | Community ingest | Manage community ingest |
Mutex | String Data type | A FeatureBase field type similar to the Set type, in which only a single value can be set at any time. Conceptually similar to an enum type, but implemented on top of Set fields, with a performance cost from the single-value constraint. Not to be confused with the mutex synchronization primitive. |
N
Term | Context | Further information |
---|---|---|
Node | FeatureBase Community Cluster | An individual running instance of FeatureBase server which belongs to a cluster. |
Normalizing | The act of identifying the cardinality of your data in order to design the relationships between different tables. | FeatureBase concepts |
NOT | PQL query | PQL NOT read query |
O
Term | Context | Further information |
---|---|---|
Options | PQL Options query | PQL OPTIONS |
Organization | FeatureBase Cloud | FeatureBase Organization |
P
Term | Context | Further information |
---|---|---|
PERCENTILE | PQL query | PQL PERCENTILE read query |
Pilosa | Former name of FeatureBase | Pilosa + Molecula = FeatureBase blog post |
Pilosa Query Language (PQL) | Database queries | PQL-Guide |
Protobuf | Binary serialization format used for internal messages which can be used by clients as an alternative to JSON. Protobuf |
Q
Term | Context | Further information |
---|---|---|
Query (PQL) | Pilosa Query Language | PQL Guide |
Query (SQL) | Structured Query Language | SQL Guide |
R
Term | Context | Further information |
---|---|---|
Record Row | Database table row | Equivalent to RDBMS table row. FeatureBase uses “Record” to avoid confusion |
Replicacluster.replicas parameter | FeatureBase Community Clusters | Replica of shard within a cluster. cluster.replicas configuration parameter determines the number of shard replicas within a cluster where replicas=1 indicates no copies have been made. |
Roaring Bitmap | FeatureBase database | roaringbitmap.org |
Row | Rows are the fundamental vertical data axis within FeatureBase. Rows are namespaced by field so the same row ID in a different field refers to a different row. | |
Row _id | ||
Row (Ranged) | PQL query | PQL Row read query |
Row (Timestamp) | PQL query | PQL Row read query |
Rows | PQL query | PQL Rows read query |
S
Term | Context | Further information |
---|---|---|
SET | PQL query | PQL SET write query |
Shard | Roaring Bitmap format | Roaring Bitmap Format |
Shard | FeatureBase Community Cluster | FeatureBase Community cluster |
ShardWidthcluster.shardwidth parameter | FeatureBase Community Cluster | FeatureBase Community cluster |
SORT | PQL query | PQL SORT read query |
STORE | PQL query | PQL STORE write query |
SUM | PQL query | PQL SUM read query |
T
Term | Context | Further information |
---|---|---|
Throughput | Data import/ingestion | Quantity of data that can be imported/ingested in a given time. May involve trade-off between Latency and Freshness |
Time Quantum | SQL IDSET and STRINGSET constraint | IDSET data type STRINGSET data type |
Timestamp | Data type | Timestamp data type |
TTL (Time To Live) | IDSET and STRINGSET constraint | IDSET data type STRINGSET data type |
TOML | FeatureBase Community configuration files | Tom’s Obvious Minimal Language (TOML) |
TopK | PQL query | PQL TOPK read query |
TopN | PQL query | PQL TOPN read query |
U
Term | Context | Further information |
---|---|---|
UNION | PQL query | PQL UNION read query |
UNIONROWS | PQL query | PQL UNIONROWS read query |
V
Term | Context | Further information |
---|---|---|
View | FeatureBase fields | Internally managed method to separate data layouts within a field. Not exposed by the API |
View (Primary) | FeatureBase fields | Standard view that represents typical base data |
View (Time-based) | FeatureBase fields | Automatically generated view for time quantum fields |
W - X - Y - Z
Term | Context | Further information |
---|---|---|
XOR | PQL query | PQL XOR read query |