Databend is a modern cloud data warehouse, serving your massive-scale analytics needs at low cost and complexity. Open source alternative to Snowflake. Also available in the cloud: https://app.databend.com .
What's New
Stay informed about the latest features of Databend.
Querying Data on HuggingFace File System with Databend
Hugging Face is currently the most popular AI community. Databend now supports direct queries and analysis of massive data and models stored on the Hugging Face file system.
URI format: hf://{repo_id}/path/to/file
, where a repo_id
might look like fka/awesome-chatgpt-prompts
.
Supported configurations include:
repo_type
: The type of HuggingFace repository, default isdataset
, available options aredataset
,model
.revision
: The revision version of HuggingFace, default ismain
. Can be a branch, tag, or commit in the repository.token
: The API token of HuggingFace.
The following example queries fka/awesome-chatgpt-prompts
and lists the first 5 rows of the first column in a CSV file.
If you would like to learn more, please contact the Databend team or refer to the resources listed below:
Code Corner
Discover some fascinating code snippets or projects that showcase our work or learning journey.
Data Type Mappings across Databend, MySQL, and Oracle
This table provides an outline of the mapping of data types between Databend, MySQL, and Oracle.
Databend | MySQL | Oracle |
---|---|---|
TINYINT | TINYINT | NUMBER(3,0) |
SMALLINT | SMALLINT | NUMBER(5,0) |
INT | INT | NUMBER(10,0) |
BIGINT | BIGINT | NUMBER(19,0) |
FLOAT | FLOAT | FLOAT |
DOUBLE | DOUBLE | FLOAT(24) |
DECIMAL | DECIMAL | FLOAT(24) |
DATE | DATE | DATE |
TIMESTAMP | TIMESTAMP | NUMBER |
DATETIME | DATETIME | DATE |
YEAR | INT | NUMBER |
VARCHAR | VARCHAR | VARCHAR2 |
VARCHAR | CHAR | CHAR |
VARBINARY | VARBINARY | RAW, BLOB |
VARCHAR | VARCHAR | VARCHAR2 |
VARCHAR | VARCHAR | RAW, CBLOB |
VARBINARY | VARBINARY | RAW, BLOB |
VARCHAR | VARCHAR | RAW, CBLOB |
VARCHAR | VARCHAR | VARCHAR2 |
VARCHAR | VARCHAR | VARCHAR2 |
ARRAY | N/A | N/A |
BOOLEAN | N/A | N/A |
TUPLE | N/A | N/A |
MAP | N/A | N/A |
JSON, VARIANT | JSON | JSON |
BITMAP | N/A | N/A |
Highlights
We have also made these improvements to Databend that we hope you will find helpful:
- Added the
Binary
data type and support for conversion between String and Binary. - Support for adaptive filter reorder.
- Support for JSON function
concat
. - Support for automatic refresh of the
ReadOnlyAttach
table schema. - Support for a greedy JOIN order algorithm.
What's Up Next
We're always open to cutting-edge technologies and innovative ideas. You're more than welcome to join the community and bring them to Databend.
Adding Support for Task Advice System Table
Databend plans to introduce the task_advice
system table to provide effective insights on daily operations and help database administrators to manage their data more easily.
catalog_name | database_name | table_name | task_type | need_run | task_sql | reason(variant) |
---|---|---|---|---|---|---|
default | db | xx | COMPACT | 1 | optimize table xx compact limit 3 | "{status in json}" |
default | db | xx | AGGREGATING_INDEX | 0 | refresh aggregating index xx_agg_idx limit 10 | "{status in json}" |
default | db | yy | ADD_CLUSTER_KEY | 1 | alter table yy cluster by(col1) | "{status in json}" |
Issue #14323 | feat: task_advice system table
Please let us know if you're interested in contributing to this feature, or pick up a good first issue at https://link.databend.rs/i-m-feeling-lucky to get started.
Changelog
You can check the changelog of Databend Nightly for details about our latest developments.
Full Changelog: https://github.com/datafuselabs/databend/compare/v1.2.286-nightly...v1.2.296-nightly
ContributorsA total of 19 contributors participated
A total of 19 contributors participated
We are very grateful for the outstanding work of the contributors.