Commit Graph

7241 Commits

Author SHA1 Message Date
lyw 2cc7a0547d
[Enhancement] fix null_or_empty can support array type (#62207)
Signed-off-by: liyanwei14 <liyanwei1@jd.com>
Signed-off-by: stdpain <34912776+stdpain@users.noreply.github.com>
Co-authored-by: stdpain <34912776+stdpain@users.noreply.github.com>
2025-08-29 11:57:02 +08:00
eyes_on_me 6c220d0dac
[Enhancement] support expr reuse in outer join where predicates (#62139)
Signed-off-by: silverbullet233 <3675229+silverbullet233@users.noreply.github.com>
2025-08-29 11:19:32 +08:00
shuming.li b7380d396c
[Feature] (IVM Part4) Support more agg combinator functions (#62122)
Signed-off-by: shuming.li <ming.moriarty@gmail.com>
2025-08-29 11:17:51 +08:00
zihe.liu 08ec7af660
[BugFix] Fix compile memcpy_inlined_overflow16 on ARM (#62478)
Signed-off-by: zihe.liu <ziheliu1024@gmail.com>
2025-08-29 10:01:23 +08:00
zihe.liu 6034903e92
[BugFix] Fix division by zero for partition hash join (#62474)
Signed-off-by: zihe.liu <ziheliu1024@gmail.com>
2025-08-29 09:40:04 +08:00
starrocks-xupeng ad5ae303b4
[Enhancement] reduce unnecessary storage related logs (#62121)
Signed-off-by: starrocks-xupeng <xupeng@starrocks.com>
2025-08-28 17:28:00 +08:00
zihe.liu 67f6144903
[BugFix] Fix UAF for BinaryColumn::append_selective (#62410)
Signed-off-by: zihe.liu <ziheliu1024@gmail.com>
2025-08-28 15:24:51 +08:00
stdpain dce102af1c
[BugFix] Fix integer overflow caused by integer left shift in compression key (#62366)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-28 14:47:05 +08:00
eyes_on_me 5b5c24e6fd
[BugFix] fix lambda common expr slot id conflicts in array_map (#62414)
Signed-off-by: silverbullet233 <3675229+silverbullet233@users.noreply.github.com>
2025-08-28 05:50:04 +00:00
stdpain 8018ed1de5
[UT] Fix ut append_chunk_safe (#62413)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-28 11:31:06 +08:00
Murphy a823459549
[BugFix] fix the overlap check of zonemap (#62369) 2025-08-28 10:19:44 +08:00
stdpain 15cf9745e3
[BugFix] Fix UAF when FixedLengthColumn append self (#62375)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-27 19:58:02 +08:00
Yixin Luo 4fdbf8f379
[BugFix] fix combine txnlog vacuum issue when delete tablets (#62363)
Signed-off-by: luohaha <18810541851@163.com>
2025-08-27 19:02:39 +08:00
Murphy 61474e35c8
[Enhancement] create adaptive zonemap index for strings (#61965)
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2025-08-27 11:03:35 +08:00
zihe.liu 09a7019fb7
[Enhancement] Adjust partition hash join strategy (#61405)
Signed-off-by: zihe.liu <ziheliu1024@gmail.com>
2025-08-27 10:46:43 +08:00
Murphy 1a10dd2007
[Enhancement] enable flat json by default (#62097) 2025-08-27 10:14:55 +08:00
Simon Griebel f67ca1b875
[Enhancement] Improve the runtime of materialize_by_permutation (#62102)
Signed-off-by: Simon Griebel <s.griebel@celonis.com>
2025-08-27 09:50:20 +08:00
zihe.liu 6ba3236b1c
[UT] Fix join test timeout (#62298)
Signed-off-by: zihe.liu <ziheliu1024@gmail.com>
2025-08-26 13:50:31 +08:00
Murphy bfcc5d3204
[Enhancement] create string column zonemap with prefix truncation (#61975)
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2025-08-26 13:22:21 +08:00
Yixin Luo 33ea13c282
[BugFix] avoid BE crash when LakePersistentIndex init fail (#62279)
Signed-off-by: luohaha <18810541851@163.com>
2025-08-25 12:12:54 +00:00
zhanghe 44456376f3
[BugFix]Fix the error of using GTID to handle dirty tablet metadata (#62275)
Signed-off-by: edwinhzhang <edwinhzhang@tencent.com>
2025-08-25 09:39:30 +00:00
zihe.liu 3b5872dfad
[Enhancement] Use linear-chained to optimize hash join (#61429)
Signed-off-by: zihe.liu <ziheliu1024@gmail.com>
2025-08-25 16:40:11 +08:00
xiangguangyxg 4389f39325
[BugFix] Add lock when get max version from tablet in replication txn manager (#62238)
Signed-off-by: xiangguangyxg <xiangguangyxg@gmail.com>
2025-08-25 15:59:30 +08:00
zihe.liu 5925d146b7
[Enhancement] Optimize append_selective for binary column (#62165)
Signed-off-by: zihe.liu <ziheliu1024@gmail.com>
2025-08-25 11:31:56 +08:00
yan zhang 8bbb2a3305
[Enhancement] short circuit optimization on select limit case (on Scan Node) (#62188)
Signed-off-by: yan zhang <dirtysalt1987@gmail.com>
2025-08-25 11:01:44 +08:00
Seaven d6b3d16bf4
[BugFix] Fix push down string expr error (#62224)
Signed-off-by: Seaven <seaven_7@qq.com>
2025-08-25 10:48:54 +08:00
starrocks-xupeng 5a7aa8cc17
[BugFix] fix cn crash if when cache is turned off (#62174) 2025-08-25 10:17:51 +08:00
Yixin Luo 288aa732b7
[BugFix] support lazy delta column compact for size tiered compaction in pk table to reduce cost (#61930)
Signed-off-by: luohaha <18810541851@163.com>
2025-08-22 16:46:44 +00:00
Gavin 6172f84e4f
[Enhancement] Optimize the iceberg sink local sorting based on the spill partition writer (#62096)
Signed-off-by: GavinMar <yangguansuo@starrocks.com>
2025-08-23 00:20:56 +08:00
Murphy 87713094cc
[BugFix] check if it's flatjson before non-existent field optimization (#62227) 2025-08-23 00:20:29 +08:00
trueeyu b3305cceeb
[Refactor] Move all scanners to a single directory. (#62204)
Signed-off-by: trueeyu <lxhhust350@qq.com>
2025-08-22 15:17:02 +08:00
zombee0 96a37ca97b
[BugFix]runtime filter partition are bucket aware (#62191)
Signed-off-by: zombee0 <ewang2027@gmail.com>
2025-08-22 11:21:05 +08:00
stdpain b5ec74838a
[Enhancement] make profile counter as compile option (#62103)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-22 10:02:49 +08:00
Yixin Luo c695a4d2b7
[Enhancement] update persistent index size statistic when do major compaction (#62195)
Signed-off-by: luohaha <18810541851@163.com>
2025-08-21 12:53:46 +00:00
stdpain 67ab5e56ad
[BugFix] Fix partitioned hash join crash when enable query cache (#62146)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-21 15:25:59 +08:00
stdpain 5a78cb3c75
[BugFix] Fix throw exception issue in low-cardinality optimization error in ALLOW_THROW_EXCEPTION mode (#62098)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-20 16:51:29 +08:00
Murphy aa4adc5a3d
[Enhancement] Optimize accessing non-existent JSON field (#62003) 2025-08-20 14:46:07 +08:00
Murphy 29c8d76cd0
[BugFix] fix json global dict with heterogeneous schema (#62001) 2025-08-20 10:48:17 +08:00
zihe.liu a7f9942582
[BugFix] Fix set_tablet_schema for partition_morsel_queue(split_morsel_queue) (#62034)
Signed-off-by: zihe.liu <ziheliu1024@gmail.com>
2025-08-20 10:01:39 +08:00
shuming.li 087fd0a622
[Refactor] (IVM Part3) Refactor AggStateCombinator (#61951)
Signed-off-by: shuming.li <ming.moriarty@gmail.com>
2025-08-20 10:01:17 +08:00
wyb c1e1b1fdb4
[Enhancement] Improve fe tablet schedules system table (#62073)
Signed-off-by: wyb <wybb86@gmail.com>
2025-08-19 23:22:38 +08:00
wyb c301a76c4b
[BugFix] Fix missing clone copy size and duration (#62074)
Signed-off-by: wyb <wybb86@gmail.com>
2025-08-19 20:22:42 +08:00
wangshifa.1 d0c5c0cc33
[Feature] Add date function of weekday (#61865)
Why I'm doing:
Fixes #61695

What I'm doing:
Fixes #61695 , add WEEKDAY function similar to the dayofweek_iso function, but it returns numbers from 0 to 6 (corresponding to Monday to Sunday)

Signed-off-by: wangshifa.1 <wangshifa.1@jd.com>
2025-08-19 16:16:45 +08:00
Hongkun Xu 72fe131ace
[Feature] Support MATCH_ALL operator (#61906)
Signed-off-by: Hongkun Xu <xuhongkun666@163.com>
2025-08-19 15:46:49 +08:00
Gavin 14a8c1415a
[Enhancement] Introduce a connector partition chunk writer to support spilling chunk data for iceberg table sink. (#61963)
Signed-off-by: GavinMar <yangguansuo@starrocks.com>
2025-08-19 11:38:35 +08:00
stdpain 9c04977e7a
[Enhancement] avoid pow regression in glibc-2.34 (#62044)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-19 10:22:05 +08:00
stdpain 48a7ac22e0
[BugFix] Fix group by compressed key cause wrong result on decimal (#62022)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-18 19:55:24 +08:00
stdpain c951cc6dfd
[Enhancement] dump distro and arch info in crash log (#62017)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-18 15:46:15 +08:00
Yixin Luo 463b29f8bd
[BugFix] avoid get file size in report tablet stat thread (#61901)
Signed-off-by: luohaha <18810541851@163.com>
2025-08-18 15:35:39 +08:00
zhangqiang 3309b7555e
[Refactor] Change some vlog level (#61995)
Signed-off-by: sevev <qiangzh95@gmail.com>
2025-08-18 12:00:12 +08:00
srlch 441f126793
[Enhancement] VacuumFull Implementation (#61602)
Signed-off-by: srlch <linzichao@starrocks.com>
Co-authored-by: Connor Brennan <cbrennan@pinterest.com>
2025-08-18 10:37:20 +08:00
Murphy 3e64a479b4
[Enhancement] optimize GlobalDictCodeColumnIterator::decode_string_dict_codes (#62002) 2025-08-18 10:35:53 +08:00
yan zhang cb98f70a4e
[BugFix] fix parquet array write when split null string (#61999)
Signed-off-by: yan zhang <dirtysalt1987@gmail.com>
2025-08-18 10:21:58 +08:00
Murphy 413d6b9651
[Enhancement] support encode_sort_key function (#61781)
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2025-08-15 15:26:22 +08:00
stdpain 9000728aa5
[BugFix] Correct add query context to context conditions (#61929)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-14 19:14:42 +08:00
zhangqiang bb99b62b44
[BugFix] Disable sync_publish for shadow tablet (#61887)
Signed-off-by: sevev <qiangzh95@gmail.com>
2025-08-14 17:21:34 +08:00
Gavin 55cb223971
[Refactor] Introduce a load chunk spiller and refactor the load spill memtable sink based on it. (#61866)
Why I'm doing:
StarRocks supports spilling some intermediate data to disk or object storage when writing to native table. This can avoid wring too many small files under memory pressure.

The same issue is also exist when writing external table. However, now the spill procedure heavily coupled with native table and cannot be reused by external table directly.

So, it is necessary to introduce a separate module to implement the spill function, which can easily be used by native and external table.

What I'm doing:
Introduce a load chunk spiller to handle the load and merge functions.
Refactor the spill memtable sink of native table based on the load chunk spiller.

Signed-off-by: GavinMar <yangguansuo@starrocks.com>
2025-08-14 15:14:37 +08:00
Yixin Luo 39cf319bf9
[BugFix] avoid hold tablet shard lock to get compaction score (#61899)
Signed-off-by: luohaha <18810541851@163.com>
2025-08-14 10:36:17 +08:00
meegoo 84243343b8
[Feature] Support multi statement transaction (part1) - stream load (#61362)
Signed-off-by: meegoo <meegoo.sr@gmail.com>
2025-08-14 10:17:58 +08:00
zihe.liu 6b0fd1ee94
[BugFix] Fix NPE for JoinHashTable::mem_usage (#61872)
Signed-off-by: zihe.liu <ziheliu1024@gmail.com>
2025-08-14 10:11:15 +08:00
stdpain 0e12bcc9cf
[BugFix] Fix QueryContext cancel may cause use-after-free (#61897)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-14 09:37:11 +08:00
stdpain 5f6cdde3a0
[Enhancement] support group by compressed key (#61632)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-14 09:32:20 +08:00
xiangguangyxg 4ef246685f
[Enhancement] Separate path id from physical partition id (#61854)
Signed-off-by: xiangguangyxg <xiangguangyxg@gmail.com>
2025-08-13 17:01:36 +08:00
zhanghe 7b95d648bd
[BugFix] Fix the problem with the number of rebuild file counted. (#61859)
Signed-off-by: edwinhzhang <edwinhzhang@tencent.com>
2025-08-13 16:46:22 +08:00
yan zhang 46c2c0f5af
[BugFix] fix min/max optimization on iceberg on partition columns (#61858)
Signed-off-by: yan zhang <dirtysalt1987@gmail.com>
2025-08-13 14:11:16 +08:00
Murphy 561f99eeac
[Enhancement] Implement function json_contains (#61403)
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2025-08-13 10:42:27 +08:00
zombee0 2a3e4bc8a7
[BugFix]sqlserver doesn't support timeout greater than 65535 (#61719)
Signed-off-by: zombee0 <ewang2027@gmail.com>
2025-08-12 15:46:46 +08:00
zombee0 b9dbceaa2b
[BugFix] set bucket_aware for shuffler (#61801)
Signed-off-by: zombee0 <ewang2027@gmail.com>
2025-08-12 14:13:15 +08:00
wyb 2a611f4a27
[Enhancement] Add tablet info collection time in tablet report (#61643)
Signed-off-by: wyb <wybb86@gmail.com>
2025-08-12 11:53:27 +08:00
srlch 16af4ce6ab
[BugFix] Fix auto increment value lost when partial update in COLUMN_UPSERT_MODE for share nothing (#61341)
Signed-off-by: srlch <linzichao@starrocks.com>
2025-08-12 11:13:00 +08:00
Murphy cc7c240861
[Enhancement] collect global dict for flatjson (#61680)
Signed-off-by: Murphy <mofei@starrocks.com>
2025-08-12 11:00:56 +08:00
trueeyu 1db0060022
[BugFix] Fix the bug in JDBC's processing of the TIME type (#61783)
Signed-off-by: trueeyu <lxhhust350@qq.com>
2025-08-12 10:18:52 +08:00
starrocks-xupeng 26ade294ed
[Enhancement] support write starlet file with tag (#61605)
Signed-off-by: starrocks-xupeng <xupeng@starrocks.com>
Signed-off-by: 絵空事スピリット <wanglichen@starrocks.com>
Co-authored-by: 絵空事スピリット <wanglichen@starrocks.com>
2025-08-11 14:32:43 +00:00
Hongkun Xu f8e7371c48
[Feature] Support MATCH_ANY operator (#60986)
Signed-off-by: Hongkun Xu <xuhongkun666@163.com>
2025-08-11 22:24:13 +08:00
starrocks-xupeng ac8a74a78e
[BugFix] support configure starcache inline cache count limit (#61724)
Signed-off-by: starrocks-xupeng <xupeng@starrocks.com>
2025-08-11 13:49:05 +08:00
Murphy bf792a6455
[BugFix] fix builtin_function fuzzy test (#61530)
What I'm doing:
bits_function: the implementation is wrong
change the static DCHECK to dynamic argument validation for some functions
fix some type mapping error in the logical_type.cpp

Signed-off-by: Murphy <mofei@starrocks.com>
2025-08-11 13:39:06 +08:00
eyes_on_me 8b3e28ad43
[BugFix] fix mem alloc issue of AggHashSetOfSerializedKey (#61558)
Signed-off-by: silverbullet233 <3675229+silverbullet233@users.noreply.github.com>
2025-08-11 10:54:42 +08:00
Murphy 51426082f2
[BugFix] fix the heap-use-after-free issue of json_remove (#61714) 2025-08-10 15:00:43 +08:00
zhanghe c2e3ae1d5c
[BugFix]Fix java.time.LocalDate type check. (#61684)
Signed-off-by: edwinhzhang <edwinhzhang@tencent.com>
2025-08-08 15:17:54 +08:00
wyb d6088ff298
[Enhancement] Bump librdkafka to 2.11.0 for kafka 4.0 (#61698)
Signed-off-by: wyb <wybb86@gmail.com>
2025-08-08 06:31:54 +00:00
Murphy e270cf409b
[Enhancement] FlatJSON V2 for lake table (#61663)
Signed-off-by: Murphy <mofei@starrocks.com>
2025-08-08 10:21:05 +08:00
PengFei Li e8982e7797
[Enhancement] Add prepared_timeout configuration for transaction stream load (#61539)
## Why I'm doing:

Currently, users can only configure the timeout for prepared transactions through the global FE configuration `prepared_transaction_default_timeout_second`. This approach lacks flexibility as it requires all transactions to use the same timeout value. Users need the ability to specify different timeout values for different transactions based on their specific requirements, especially in production environments where precise control over transaction lifecycle is crucial.

## What I'm doing:

This PR adds support for the `prepared_timeout` configuration in transaction stream load, allowing users to specify a timeout period for transactions from PREPARED to COMMITTED state. The implementation includes:

**Backend Changes:**
- Added `HTTP_PREPARED_TIMEOUT` constant in `be/src/http/http_common.h`
- Extended `StreamLoadContext` with `prepared_timeout_second` field
- Modified `TransactionMgr` to parse `prepared_timeout` HTTP header
- Updated `StreamLoadExecutor::prepare_txn` to pass timeout to FE
- Enhanced `TransactionState` with `preparedTimeoutMs` field and timeout detection logic
- Updated Thrift interface `TLoadTxnCommitRequest` with `prepared_timeout_second` field

**Frontend Changes:**
- Modified `TransactionLoadAction` to parse `prepared_timeout` parameter
- Updated `TransactionState` with `setPreparedTimeAndTimeout` method
- Enhanced `DatabaseTransactionMgr` and `GlobalTransactionMgr` to handle prepared timeout
- Updated transaction timeout detection logic in `TransactionState::isTimeout`

**Usage Example:**
```bash
# Begin transaction
curl --location-trusted -u root: -H "label:test_txn" -H "timeout:300" -H "db:test_db" -H "table:test_table" \
    -XPOST http://fe_host:8030/api/transaction/begin

# Load data
curl --location-trusted -u root: -H "label:test_txn" -H "db:test_db" -H "table:test_table" \
    -d '1' -XPUT http://fe_host:8030/api/transaction/load

# Prepare transaction with custom timeout (60 seconds)
curl --location-trusted -u root: -H "label:test_txn" -H "db:test_db" \
    -H "prepared_timeout:60" -XPOST http://fe_host:8030/api/transaction/prepare

# Commit transaction
curl --location-trusted -u root: -H "label:test_txn" -H "db:test_db" \
    -XPOST http://fe_host:8030/api/transaction/commit

# View transaction details including PreparedTime and PreparedTimeoutMs
SHOW TRANSACTION WHERE id = <transaction_id>;
+---------------+--------+---------------+-------------------+-------------------+---------------------+---------------------+---------------------+---------------------+---------------------+--------+--------------------+------------+-----------+-------------------+--------+
| TransactionId | Label  | Coordinator   | TransactionStatus | LoadJobSourceType | PrepareTime         | PreparedTime        | CommitTime          | PublishTime         | FinishTime          | Reason | ErrorReplicasCount | ListenerId | TimeoutMs | PreparedTimeoutMs | ErrMsg |
+---------------+--------+---------------+-------------------+-------------------+---------------------+---------------------+---------------------+---------------------+---------------------+--------+--------------------+------------+-----------+-------------------+--------+
| 1633          | test_txn | BE: 127.0.0.1 | VISIBLE           | BACKEND_STREAMING | 2025-08-03 11:02:54 | 2025-08-03 11:03:10 | 2025-08-03 11:03:14 | 2025-08-03 11:03:14 | 2025-08-03 11:03:14 |        | 0                  | [12237]    | 300000    | 60000             |        |
+---------------+--------+---------------+-------------------+-------------------+---------------------+---------------------+---------------------+---------------------+---------------------+--------+--------------------+------------+-----------+-------------------+--------+
```

**Documentation:**
- Updated `Stream_Load_transaction_interface.md` with `prepared_timeout` usage instructions
- Modified `SHOW_TRANSACTION.md` to document new `PreparedTime` and `PreparedTimeoutMs` fields
- Added version information indicating support from 4.0.0 onwards

The feature provides backward compatibility by using the FE configuration `prepared_transaction_default_timeout_second` as the default value when `prepared_timeout` is not specified.

Signed-off-by: PengFei Li <lpengfei2016@gmail.com>
Signed-off-by: 絵空事スピリット <wanglichen@starrocks.com>
Co-authored-by: 絵空事スピリット <wanglichen@starrocks.com>
2025-08-08 09:55:08 +08:00
wyb 0bca11047d
[Enhancement] Reorder tablet_balanced column of partitions_meta system table for better compatibility (#61665)
Signed-off-by: wyb <wybb86@gmail.com>
2025-08-07 18:52:11 +08:00
yan zhang d70d3294f6
[UT] fix query id not found when cancelled (#61667)
Signed-off-by: yan zhang <dirtysalt1987@gmail.com>
2025-08-07 06:54:55 +00:00
Murphy 0a9618db4c
[Feature] implement json_remove (#61394)
Signed-off-by: Murphy <mofei@starrocks.com>
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2025-08-07 14:50:13 +08:00
stdpain e338f7ac00
[BugFix] Fix min/max by crash when process literal inputs (#61651)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-07 14:43:05 +08:00
before-Sunrise 5d59303970
[BugFix] Revert 'Avoid brpc communication when using local pass through' (#61631)
Signed-off-by: before-Sunrise <unclejyj@gmail.com>
2025-08-07 14:26:33 +08:00
Murphy 7dc4d83de8
[Enhancement] FlatJSON-V2: part fe (#61598) 2025-08-07 14:22:43 +08:00
liubotao 44c52750a2
[Enhancement] Shared Data Mode Support Flat Json parameters table-level configs (#61160)
Signed-off-by: liubotao <316945435@qq.com>
2025-08-07 14:08:47 +08:00
zombee0 27b0ef7a22
[Enhancement]local-exchange support bucket-aware execution (#61592)
Signed-off-by: zombee0 <ewang2027@gmail.com>
2025-08-07 11:20:24 +08:00
wyb 3caf4c6f5d
[Enhancement] Show tablet distribution balance statistic (#61549)
Signed-off-by: wyb <wybb86@gmail.com>
2025-08-07 10:57:50 +08:00
srlch 3e1ff80062
[Enhancement] Support System table for dynamic tablet jobs (#61152)
Signed-off-by: srlch <linzichao@starrocks.com>
2025-08-07 09:54:32 +08:00
Xie Bofan 7557647309
[Enhancement] Add header setting support to http_client (#61621)
Signed-off-by: xiebofan <1814739992@qq.com>
2025-08-06 07:25:12 +00:00
Yixin Luo 35355c82cb
[Enhancement] Sort while constructing the mapping from source file rowids to update file rowids when partial update (#61488)
Signed-off-by: luohaha <18810541851@163.com>
2025-08-06 12:48:59 +08:00
stdpain da27736352
[BugFix] Fix maxmin_by window function primitive type output cause unmatched length chunk (#61580)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-06 09:49:00 +08:00
yan zhang d8ff13b53b
[BugFix] fix query hang because of incorrect scan range delivery (#61562)
Signed-off-by: yan zhang <dirtysalt1987@gmail.com>
2025-08-05 10:35:14 +00:00
stdpain a1921ff837
[BugFix] notify should be call after setting streaming_all_states (#61591)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-05 18:23:41 +08:00
zombee0 f9d77d4014
[Enhancement]exchange node support bucket aware execution (#61554)
Signed-off-by: zombee0 <ewang2027@gmail.com>
2025-08-05 15:39:57 +08:00
Mathieu Baurin a5dbbc17cd
[Feature] Add format_bytes function for human-readable byte formatting (#61535)
Signed-off-by: Mathieu Baurin <1mathieu.baurin@gmail.com>
Signed-off-by: stdpain <34912776+stdpain@users.noreply.github.com>
Co-authored-by: stdpain <34912776+stdpain@users.noreply.github.com>
2025-08-05 12:07:38 +08:00
Murphy e88244ec37
[Enhancement] FlatJSON-V2 part 1: BE code (#61447)
Signed-off-by: Murphy <mofei@starrocks.com>
2025-08-04 19:21:22 +08:00
qingzhongli 7df950e7d8
[UT] Fix sse_memcmp UT compilation error on aarch64 (#61569)
Fix sse_memcmp UT compilation error on aarch64.

## Why I'm doing:
```
[ 96%] Building CXX object test/CMakeFiles/starrocks_test_objs.dir/util/monotime_test.cpp.o
[ 96%] Building CXX object test/CMakeFiles/starrocks_test_objs.dir/util/mysql_row_buffer_test.cpp.o
/root/starrocks/be/test/util/memcmp_test.cpp: In member function 'virtual void starrocks::sse_memcmp_Test_Test::TestBody()':
/root/starrocks/be/test/util/memcmp_test.cpp:38:20: error: 'sse_memcmp2' was not declared in this scope
   38 |         int res2 = sse_memcmp2(c1, c2, 3);
      |                    ^~~~~~~~~~~
/root/starrocks/be/test/util/memcmp_test.cpp:46:20: error: 'sse_memcmp2' was not declared in this scope
   46 |         int res2 = sse_memcmp2(c1, c2, 3);
      |                    ^~~~~~~~~~~
/root/starrocks/be/test/util/memcmp_test.cpp:54:20: error: 'sse_memcmp2' was not declared in this scope
   54 |         int res2 = sse_memcmp2(c1, c2, 3);
      |                    ^~~~~~~~~~~
/root/starrocks/be/test/util/memcmp_test.cpp:62:20: error: 'sse_memcmp2' was not declared in this scope
   62 |         int res2 = sse_memcmp2(c1, c2, 3);
      |                    ^~~~~~~~~~~
/root/starrocks/be/test/util/memcmp_test.cpp:71:20: error: 'sse_memcmp2' was not declared in this scope
   71 |         int res2 = sse_memcmp2(c1, c2, 3);
      |                    ^~~~~~~~~~~
/root/starrocks/be/test/util/memcmp_test.cpp:80:20: error: 'sse_memcmp2' was not declared in this scope
   80 |         int res2 = sse_memcmp2(c1, c2, 3);
      |                    ^~~~~~~~~~~
/root/starrocks/be/test/util/memcmp_test.cpp:89:20: error: 'sse_memcmp2' was not declared in this scope
   89 |         int res2 = sse_memcmp2(c1, c2, 3);
      |                    ^~~~~~~~~~~
/root/starrocks/be/test/util/memcmp_test.cpp:98:20: error: 'sse_memcmp2' was not declared in this scope
   98 |         int res2 = sse_memcmp2(c1, c2, 3);
      |                    ^~~~~~~~~~~
make[2]: *** [test/CMakeFiles/starrocks_test_objs.dir/util/memcmp_test.cpp.o] Error 1
make[2]: *** Waiting for unfinished jobs....
make[1]: *** [test/CMakeFiles/starrocks_test_objs.dir/all] Error 2
make: *** [all] Error 2
```

Signed-off-by: qingzhongli <qingzhongli2018@gmail.com>
2025-08-04 15:45:17 +08:00
Yixin Luo 7dac2090e1
[Enhancement] reuse I/O when reading bundled tablet meta (#61413)
Signed-off-by: luohaha <18810541851@163.com>
2025-08-04 10:47:34 +08:00
Yixin Luo 75854adf72
[Enhancement] optimize tablet meta copy when enable file bundling (#61410)
Signed-off-by: luohaha <18810541851@163.com>
2025-08-04 10:47:21 +08:00
Kevin Cai a3a0a01140
[BugFix] fix file-prefix-map, remove the build_XXX part (#61540)
* -ffile-prefix-map=/build/starrocks/be/build_RELEASE=. -ffile-prefix-map=/build/starrocks/be=be
* before this fix: source file lists 
```
  be/build_RELEASE/be/src/agent/agent_common.h
  be/build_RELEASE/be/src/agent/agent_server.cpp
  ...
  be/build_RELEASE/be/src/util/value_generator.h
  be/build_RELEASE/be/src/util/xxhash.h
``` 
  after this fix: 
```
 ./be/src/agent/agent_common.h
 ./be/src/agent/agent_server.cpp
 ...
 ./be/src/util/value_generator.h
 ./be/src/util/xxhash.h
```

Signed-off-by: Kevin Cai <kevin.cai@celerdata.com>
2025-08-04 09:11:24 +08:00
zhangqiang d89a2f64f4
[Refactor] Change the data type of data_size column in the partitions_meta table to bigint. (#61251)
Signed-off-by: sevev <qiangzh95@gmail.com>
2025-08-02 11:32:38 +08:00
Kevin Cai 45c2310372
[BugFix] don't try to build MFV versions for the instructions turned off (#61532)
the cpu instruction is off either because of not wanted the target instruction set or the build machine doesn't have the instruction supported. Be respectful to the instruction switch.

Signed-off-by: Kevin Cai <kevin.cai@celerdata.com>
2025-08-02 11:02:34 +08:00
stdpain 94726f0973
[BugFix] Fix UAF in local-partition preagg (#61524)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-08-01 09:37:18 +00:00
stdpain 14fca55647
[BugFix] Fix local-passthrough cancel dead lock (#61487)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-07-31 20:02:39 +08:00
Kevin Cai ef04362a2f
[BugFix] properly handle orc reader decompress error (#61464)
handle zstd decompress failure, throw runtime_error exception
fix orc_scanner tpch_10k.orc.zstd, it's corrupted. Replace it with correct test file and update the related test cases.

Signed-off-by: Kevin Cai <kevin.cai@celerdata.com>
2025-07-31 16:35:18 +08:00
SevenJ a362c009bc
[UT] Fix iceberg trans ut (#61459)
Signed-off-by: SevenJ <wenjun7j@gmail.com>
2025-07-31 14:38:18 +08:00
stdpain 65fd661601
[Enhancement] add vectorized implemention for assign_data_with_nulls (#61454)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-07-31 14:25:52 +08:00
zombee0 ee8bea1c33
[Enhancement]murmur3 hash to do bucket aware execution for iceberg (#61366)
Signed-off-by: zombee0 <ewang2027@gmail.com>
2025-07-31 10:05:17 +08:00
stdpain fce0346e97
[BugFix] Fix local-passthrough cause rpc to get stuck (#61427)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-07-30 16:21:23 +08:00
Fatih Çatalkaya 75b996b714
[BugFix] Set Content-Type to application/json when responding to stream load http requests (#61144)
Why I'm doing:
When sending a request to the /api/transaction/{begin,load,commit,...} endpoints, the content type is wrongly set to text/html instead of application/json.

What I'm doing:
Fixes #61130

Signed-off-by: Fatih Çatalkaya <fatih.catalkaya@yahoo.de>
Signed-off-by: Kevin Cai <kevin.cai@celerdata.com>
Co-authored-by: Kevin Cai <kevin.cai@celerdata.com>
2025-07-30 08:47:21 +08:00
Yixin Luo 768e03ec5e
[Enhancement] add idle time config for publish version thread pool (#61239)
Signed-off-by: luohaha <18810541851@163.com>
Signed-off-by: Yixin Luo <luoyixin6688@gmail.com>
Co-authored-by: 絵空事スピリット <wanglichen@starrocks.com>
2025-07-29 16:40:52 +00:00
Murphy af49488e6f
[UT] Fuzz test built-in functions with type coverage (#61303)
Signed-off-by: Murphy <mofei@starrocks.com>
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2025-07-29 11:20:47 +00:00
Kevin Cai 942a77c5bb
[UT] fix incorrect use of evhttp in stream load unit test (#61390)
should not create a separate evhttp_request in test body
shall leverage the input_buffer created in the evhttp_request initialized by evhttp_request_new()

Signed-off-by: Kevin Cai <kevin.cai@celerdata.com>
2025-07-29 18:43:01 +08:00
stdpain 04001e8618
[Enhancement] some minor optmize for read parquet files (#60551)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-07-29 18:12:53 +08:00
zhangqiang 2defcf0579
[BugFix] Fix nullptr exception during clone. (#61359)
Why I'm doing:
When clone and drop table run concurrency, the new_tablet during clone maybe dropped and throw null exception.


Signed-off-by: sevev <qiangzh95@gmail.com>
2025-07-29 16:26:44 +08:00
Yixin Luo 0f1deef421
[BugFix] fix missing partition id in combine txnlog (#61207)
Signed-off-by: luohaha <18810541851@163.com>
2025-07-29 16:25:12 +08:00
Murphy c9ea6464fe
[BugFix] compile failure in clang (#61351)
Signed-off-by: Murphy <mofei@starrocks.com>
2025-07-29 13:07:50 +08:00
stdpain 70a7f618d5
[Refactor] Refactor scalar function registration to speed up compilation (#61358)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-07-29 09:45:15 +08:00
yan zhang d46937ed5c
[UT] Fix compilation and be-ut (#61347)
Signed-off-by: yan zhang <dirtysalt1987@gmail.com>
2025-07-29 09:21:40 +08:00
Hechem Selmi b7c2561dc0
[Enhancement] Avoid brpc communication when using local pass through (#60538)
Signed-off-by: m-selmi <m.selmi@celonis.com>
Signed-off-by: stdpain <drfeng08@gmail.com>
Co-authored-by: stdpain <drfeng08@gmail.com>
2025-07-28 11:08:19 +00:00
shuming.li ac5fc3f681
[UT] [BugFix] Fix unstable JITCacheTest tests (#61331) 2025-07-28 18:47:39 +08:00
starrocks-xupeng b0f5cbbbb1
[Enhancement] add segment write time in lake compaction (#60891)
Signed-off-by: starrocks-xupeng <xupeng@starrocks.com>
2025-07-28 17:35:11 +08:00
yan zhang b84d2051e4
[UT] disable parquet asan long running ut (#61334)
Signed-off-by: yan zhang <dirtysalt1987@gmail.com>
2025-07-28 15:58:36 +08:00
stdpain 9521bd8266
[Enhancement] Introduce RETURN_IF_DCHECK_XX_FAILED (#61315)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-07-28 14:08:11 +08:00
Jun-Seok Heo 774b9d0de3
[BugFix] fix the pruned column size to be same with the unpruned one (#61271) 2025-07-28 12:06:20 +08:00
stdpain fc856ca330
[BugFix] Fix array_map crash when capture const array columns (#61309)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-07-26 15:54:39 +08:00
duanyyyyyyy 561b82cd93
[BugFix] Fix a bug that agg_state_if will not handle the streaming aggregation cases (#61084)
Signed-off-by: ‘duanyyyyyyy’ <yan.duan9759@gmail.com>
2025-07-26 12:50:47 +08:00
Kevin Cai b5cc684042
[UT] fix StarOSWorker AwsSDK cleanup issue (#61265)
Signed-off-by: Kevin Cai <kevin.cai@celerdata.com>
2025-07-25 13:49:44 +08:00
srlch cbb77d9883
[BugFix] Fix set null value for auto_increment column will reject the valid data if they are in the same chunk (#61255)
Signed-off-by: srlch <linzichao@starrocks.com>
2025-07-25 12:48:28 +08:00
Evgeniy Zuikin 81ff271a80
[BugFix] Fix array column cloning durign array comparison (#61036)
Signed-off-by: SHaaD94 <eugenzuy@gmail.com>
Signed-off-by: stdpain <drfeng08@gmail.com>
Signed-off-by: stdpain <34912776+stdpain@users.noreply.github.com>
Co-authored-by: stdpain <drfeng08@gmail.com>
Co-authored-by: stdpain <34912776+stdpain@users.noreply.github.com>
2025-07-25 11:06:42 +08:00
Murphy 2b69350d1b
[BugFix] fix hour_from_unixtime (#61206)
Signed-off-by: Murphy <mofei@starrocks.com>
2025-07-25 10:10:14 +08:00
Gavin 46601e16e4
[Enhancement] Disable the inline mode when writing data to datacache because it may cause a performance degradation. (#60530)
Signed-off-by: GavinMar <yangguansuo@starrocks.com>
2025-07-24 17:27:34 +08:00
eyes_on_me 4167aaf940
[BugFix] fix TableMetricsMgrTest (#61218)
Signed-off-by: silverbullet233 <3675229+silverbullet233@users.noreply.github.com>
2025-07-24 13:56:44 +08:00
zihe.liu 3107899823
[BugFix] Fix resource group cpu usage (#61177)
Signed-off-by: zihe.liu <ziheliu1024@gmail.com>
2025-07-23 19:37:10 +08:00
eyes_on_me d71cc3d2c7
[BugFix] reduce lock contention of TableMetricsManager (#58911)
Signed-off-by: silverbullet233 <3675229+silverbullet233@users.noreply.github.com>
2025-07-23 19:15:07 +08:00
eyes_on_me 6abb89573c
[BugFix] make scan behavior consistent on shared-data and shared-nothing (#61100)
Signed-off-by: silverbullet233 <3675229+silverbullet233@users.noreply.github.com>
2025-07-23 10:24:59 +08:00
satanson e91696fa1b
[BugFix] excluding some files involving JIT when STARROCKS_JIT_ENABLE=OFF (#61138)
Signed-off-by: satanson <ranpanf@gmail.com>
2025-07-22 16:13:37 +08:00
zihe.liu 2144db870c
[Enhancement] Use RangeDirectMapping to optimize hash join (#61124)
Signed-off-by: zihe.liu <ziheliu1024@gmail.com>
2025-07-22 15:55:54 +08:00
alexzorin 2dbfc1d516
[BugFix] set hit_count in vector index metrics (#61102)
Signed-off-by: Alex Zorin <alex@zorin.au>
2025-07-22 14:41:56 +08:00
starrocks-xupeng f3144b9a2e
[BugFix] fix cache might not be used when upgraded from 3.3 (#60973)
Signed-off-by: starrocks-xupeng <xupeng@starrocks.com>
2025-07-22 14:32:30 +08:00
stdpain 78558bcc07
[BugFix] Fix dictionary inconsistency in shared-data mode (#61006)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-07-22 14:21:27 +08:00
stdpain ebd73ed42c
[Enhancement] avoid reuse ByteBuffer when merge data in JAVA UDAF (#61054)
Signed-off-by: stdpain <drfeng08@gmail.com>
2025-07-21 15:39:22 +08:00
zihe.liu c2d4734377
[Refactor] Split join_hash_map into files (#61010)
Signed-off-by: zihe.liu <ziheliu1024@gmail.com>
2025-07-21 14:04:11 +08:00
srlch 4ac5ae833f
[Enhancement] Filter out keys using SstablePredicate for sstable after compaction (#60743)
Signed-off-by: srlch <linzichao@starrocks.com>
2025-07-21 09:33:53 +08:00
satanson f877782f08
[BugFix] Executable segments generated by JIT are not released when it is evicted from JIT cache (#61027)
Signed-off-by: satanson <ranpanf@gmail.com>
2025-07-18 16:43:09 +08:00
satanson b26637e0f5
[BugFix] disable jit in BE (#61060)
Signed-off-by: satanson <ranpanf@gmail.com>
2025-07-18 11:40:06 +08:00