Skip to content

Latest commit

 

History

History
555 lines (443 loc) · 35.4 KB

CHANGELOG.md

File metadata and controls

555 lines (443 loc) · 35.4 KB

Changelog

0.7.2 - 2023-02-17

Features

  • [Alpha] Support the new SQL clause WITH (#2846 @aceforeverd)
  • Support deploying multiple TaskManagers (#3004 @zhanghaohit)
  • Support the new built-in functions std, stddev, stddev_samp, stddev_pop, ew_avg and drawdown (#3025 #3032 #3029 @zhanghaohit)
  • Add the new configurations to specify the maximum size of RocksDB's log files (#2991 @dl239)
  • The CREATE TABLE ... LIKE PARQUET ... statement supports a parquet file as the input in the offline mode. (#2996 @tobegit3hub)
  • Support showing query results of synchronous jobs in TaskManager (#3034 @vagetablechicken)
  • Change the default timeout of synchronous jobs to 30 minutes, and add a corresponding CLI parameter for configuration (#3061 @vagetablechicken)
  • Improve the documents (#2938 #2984 #3016 @vagetablechicken, #2958 #2973 #2980 #2987 #2988 #3035 @lumianph, #2990 @lukeAyin, #2997 #3065 @tobegit3hub, #3011 #3027 @dl239, #3020 #3066 #3071 #3074 @aceforeverd, #3033 #3036 @selenachenjingxin)

Bug Fixes

  • Disk table does not clean the expired data. (#2963 @dl239)
  • Incorrect index will be added if there is LAST JOIN statement in a deployed SQL. (#2979 @dl239)
  • The result is incorrect if a window frame is specified by EXCLUDE CURRENT_ROW (#2930 @aceforeverd)
  • SQL compiling fails if there is an UDF function in an UDAF expression. (#3018 @aceforeverd)
  • Although the return information indicates success, index creation may still fail in some cases. (#3042 @vagetablechicken)
  • The recoverdata command fails if there are a large number of records in a memory table. (#3060 @dl239)
  • The deploy-all tool deploys the Spark package to local nodes only. (#3022 @zhanghaohit)
  • Other minor bug fixes (#2970 #3028 #3026 #3003 #3064 @dl239)

Code Refactoring

#2995 #3030 @aceforeverd

Note:
While we have resolved the overflow issue in the current version of the monitor component #3003, it may still persist when upgrading from an older version.

0.7.1 - 2023-01-13

Features

  • Support data import from Hive using a symbolic link (#2948 @vagetablechicken)
  • Support the new SQL statement CREATE TABLE LIKE (#2949 @aceforeverd, #2962 @tobegit3hub)
  • Improve the non-interactive CLI (#2898 @vagetablechicken)
  • Improve the documents (#2904 #2921 #2932 #2942 @selenachenjingxin, #2925 #2928 #2934 #2954 @vagetablechicken, @2924 @dl239, #2945 #2952 @lumianph, #2946 @aceforeverd)

Bug Fixes

  • The result of _*_cate is incorrect. (#2939 @zhanghaohit)
  • The deployment of SQL fails if the column name of a major table is a keyword. (#2894 @dl239)
  • Tablet may core dump when executing SQLs with disk tables. (#2926 @dl239)
  • There is memory leak when writing data into disk tables. (#2943 @dl239)
  • The result of show components is incorrect in certain cases. (#2940 @dl239)
  • Offline jobs execution fails in certain cases because the Curator component causes an incompatible issue. (#2936 @tobegit3hub)
  • Disabling the monitor log (#2953 @dl239)

Code Refactoring

#2875 #2937 @dl239

0.7.0 - 2022-12-30

Features

  • Improve the messages and errors when inserting rows (#2834 @vagetablechicken)
  • Add a new configuration max_memory to limit the memory usage of a tablet (#2815 @dl239)
  • Add new maintenance tools deploy-all and start-all (#2809 @zhanghaohit)
  • Insertion returns errors if the value of timestamp field is negative (#2776 @dl239)
  • Support the new built-in functions unix_timestamp, pmod, datediff, and size (#2843 #2839 #2847 #2864 @zhanghaohit)
  • Add the new data type of ARRAY in UDFs (#2817 @aceforeverd)
  • Improve the documents (#2868 @haseeb-xd, #2878 @Jake-00, #2879 #2876 #2889 #2890 @vagetablechicken, #2881 #2907 #2908 #2897 @selenachenjingxin, #2859 @AdvancedUno, #2893 @lumianph, #2916 @aceforeverd)

Bug Fixes

  • Window over a subquery(t1 LAST JOIN t2) fails due to column renaming in the subquery. (#2739 @aceforeverd)
  • SHOW JOBLOG fails under certain circumstances. (#2874 @tobegit3hub)
  • OP is not deleted if the related table has been dropped. (#2548 @dl239)
  • Memory is not released when deleting an index in some cases. (#2806 @dl239)
  • Changing a leader to a specified endpoint fails if there are data writing. (#2858 @dl239)
  • UDFs do not work for yarn-client and yarn-cluster in the offline mode. (#2802 @tobegit3hub)
  • Other minor bug fixes (#2828 #2903 #2906 @vagetablechicken, #2867 #2912 @dl239)

Code Refactoring

#2860 @mammar11, #2865 #2863 @vagetablechicken, #2861 #2862 #2871 @Ziy1-Tan

0.6.9 - 2022-12-08

Features

  • Add pre-upgrade and post-upgrade options in tools for upgrade (#2761 @zhanghaohit)
  • Support starting OpenMLDB with daemon (#2833 @vagetablechicken)
  • Improve the documents (#2827 @tobegit3hub, #2838 @vagetablechicken)

Bug Fixes

  • The CLI may crash if executing show job without a job ID. (#2771 @aceforeverd)
  • select count(*) may return empty result after tablets restart. (#2835 @zhanghaohit)
  • A tablet may crash if the output of SQL engine runner is null. (#2831 @dl239)

0.6.8 - 2022-11-29

Features

  • Support the where clause in the SQL batch engine (#2820 @tobegit3hub)
  • Support input and output with the JSON format in APIServer (#2813 @vagetablechicken)
  • Improve the documents (#2814 @vagetablechicken)

Code Refactoring

#2816 @dl239, #2714 @aceforeverd

0.6.7 - 2022-11-22

Features

  • Support importing and exporting data from/to Hive (@2778 @vagetablechicken)
  • Improve the module of autofe (#2777 @vagetablechicken)
  • Improve error messages of the TaskManager client (#2780 @vagetablechicken)
  • Improve the documents (#2781 @zhanghaohit, #2767 #2792 @vagetablechicken, #2805 @selenachenjingxin, #2810 @dl239)

Bug Fixes

  • Python SDK workflow may fail on MacOS. (#2783 @vagetablechicken, #2788 @dl239)
  • There are syntax errors in some log messages. (@2770 dl239)
  • Installing Python SDK requires unnessary packages. (#2791 @vagetablechicken)

0.6.6 - 2022-11-14

Features

  • Support the new build-in function hash64 (#2754 @aceforeverd)
  • Improve the documents (#2763 @dl239, #2610 #2606 @michelle-qinqin)

Bug Fixes

  • pytest command is not found in the MacOS virtual machine. (#2765 @tobegit3hub)
  • Wrong output schema passes to the WindowAggRunner. (#2758 @aceforeverd)
  • There are no outputs when executing showopstatus command if no database is specified (#2773 @dl239)
  • The data recovery tool fails in some cases (#2768 @dl239)

0.6.5 - 2022-11-04

Features

  • Optimize the distribution of table partitions (#2701 @jkpjkpjkp)
  • Add a new workflow to generate the documents of built-in functions automatically (#2709 #2729 @aceforeverd)
  • Support the new SQL statement show joblog (#2732 @aceforeverd, #2747 @tobegit3hub)
  • Add a warning message for show table status (#2738 @zhanghaohit)
  • Add a new tool for data recovery and scale-out/scale-in (#2736 @dl239)
  • Improve the documents (#2707 #2727 @aceforeverd, #2718 #2538 #2731 #2752 @vagetablechicken, #2607 #2609 @michelle-qinqin, #2733 @zhanghaohit, #2742 @auula)

Bug Fixes

  • Incorrect data will be loaded in offline mode if the schema mismatches with parquet files. (#2648 @vagetablechicken)
  • Creating index fails if specifying a database in SQL statement (#2720 @dl239)
  • start_time is not human-readable after submitting a job (#2751 @tobegit3hub)
  • Incorrect result of GetRecordIdxCnt is produced in MemTable (#2719 @jkpjkpjkp)

Code Refactoring

#2688 #2717 @vagetablechicken, #2705 #2728 @dl239, #2601 @team-317, #2737 @Jake-00

0.6.4 - 2022-10-21

Features

  • Support a new series of built-in functions top_n_value_*_cate_where (#2622 @aceforeverd)
  • Support online batch computation and aggregation over a full table (#2620 @zhanghaohit)
  • Support load_mode and thread options for LOAD DATA (#2684 @zhanghaohit)
  • Improve the documents (#2476, #2486 #2514 #2611 #2693 #2462 @michelle-qinqin, #2695 @lumianph, #2653 @vagetablechicken)
  • Support running MacOS compiling jobs in the CICD workflow (#2665 @dl239)

Bug Fixes

  • Recreating index fails if it has been dropped. (#2440 @dl239)
  • The traverse method may get duplicate data if there are same ts records on one pk (#2637 @dl239)
  • Multiple window union will fail when compiling in batch mode (#2478 @tobegit3hub, #2561 @aceforeverd)
  • select * ... statement may cause inconsistent output schemas in many cases (#2660 @aceforeverd)
  • Result is incorrect if the window is specified as UNBOUNDED PRECEDING AND CURRENT ROW EXCLUDE CURRENT_ROW (#2674 @aceforeverd)
  • Incorrect slice offsets may lead to offline jobs hang (#2687 @aceforeverd)
  • Other minor bug fixes (#2669 @dl239, #2683 @zhanghaohit)

Code Refactoring

#2541 @dl239, #2573 #2672 @haseeb-xd

0.6.3 - 2022-10-14

Features

  • Support setting the configuration of glog for clients (#2482 @vagetablechicken)
  • Add the checksum of SHA256 for release packages (#2560 @team-317)
  • Support the new build-in function unhex (#2431 @aucker)
  • Support the readable date and time format in CLI (#2568 @dl239)
  • Support the LAST JOIN with a subquery as a producer of window node in the request mode (#2569 @aceforeverd)
  • Upgrade the Spark version to 3.2.1 (#2566 @tobegit3hub, #2635 @dl239)
  • Support setting the SQL cache size in SDKs (#2605 @vagetablechicken)
  • Add a new interface of ValidateSQL to validate the syntax of SQL (#2626 @vagetablechicken)
  • Improve the documents (#2405 #2492 #2562 #2496 #2495 #2436 #2487 #2623 @michelle-qinqin, #2543 @linjing-lab, #2584 @JourneyGo, #2567 #2583 @vagetablechicken, #2643 @dl239)
  • Other minor features (#2504 #2572 #2498 #2598 @aceforeverd, #2555 #2641 @tobegit3hub, #2550 @zhanghaohit, #2595 @Elliezza, #2592 @vagetablechicken)

Bug Fixes

  • After a nameserver restarting, deployments may not recover. (#2533 @dl239)
  • If the type of first column is bool, it fails to resolve the function count_where. (#2570 @aceforeverd)
  • Other minor bug fixes (#2540 #2577 #2625 #2655 @dl239, #2585 @snehalsenapati23, #2539 @vagetablechicken)

Code Refactoring

#2516 #2520 #2522 #2521 #2542 #2531 #2581 @haseeb-xd, #2525 #2526 #2527 #2528 @kstrifonoff, #2523 @ighmaZ, #2546 #2549 @NevilleMthw, #2559 @marandabui, #2554 @gokullan, #2580 @team-317, #2599 @lbartyczak, #2594 @shivamgupta-sg, #2571 @Jake-00

0.6.2 - 2022-09-20

Features

  • Support independently executing the OpenMLDB offline engine without the OpenMLDB deployment (#2423 @tobegit3hub)
  • Support the log setting of ZooKeeper and disable ZooKeeper logs in the diagnostic tool (#2451 @vagetablechicken)
  • Support query parameters of the SQL query APIs (#2277 @qsliu2017)
  • Improve the documents (#2406 @aceforeverd, #2408 #2414 @vagetablechicken, #2410 #2402 #2356 #2374 #2396 #2376 #2419 @michelle-qinqin, #2424 #2418 @dl239, #2455 @lumianph, #2458 @tobegit3hub)
  • Other minor features (#2420 @aceforeverd, #2411 @wuyou10206, #2446 #2452 @vagetablechicken, #2475 @tobegit3hub)

Bug Fixes

  • Table creation succeeds even if partitionnum is set to 0, which should report an error. (#2220 @dl239)
  • There are thread races in aggregators if there are concurrent puts. (#2472 @zhanghaohit)
  • The limit clause dose not work if it is used with the where and group by clauses. (#2447 @aceforeverd)
  • The TaskManager process will terminate if ZooKeeper disconnects. (#2494 @tobegit3hub)
  • The replica cluster dose not create the database if a database is created in the leader cluster (#2488 @dl239)
  • When there is data in base tables, deployment with long windows still can be executed (which should report an error). (#2501 @zhanghaohit)
  • Other minor bug fixes (#2415 @aceforeverd, #2417 #2434 #2435 #2473 @dl239, #2466 @vagetablechicken)

Code Refactoring

#2413 @dl239, #2470 #2467 #2468 @vagetablechicken

0.6.1 - 2022-08-30

Features

  • Support new build-in functions last_day and regexp_like (#2262 @HeZean, #2187 @jiang1997)
  • Support Jupyter Notebook for the TalkingData use case (#2354 @vagetablechicken)
  • Add a new API to disable Saprk logs of the batch engine (#2359 @tobegit3hub)
  • Add the use case of precision marketing based on OneFlow (#2267 @Elliezza @vagetablechicken @siqi)
  • Support the RPC request timeout in CLI and Python SDK (#2371 @vagetablechicken)
  • Improve the documents (#2021 @liuceyim, #2348 #2316 #2324 #2361 #2315 #2323 #2355 #2328 #2360 #2378 #2319 #2350 #2395 #2398 @michelle-qinqin, #2373 @njzyfr, #2370 @tobegit3hub, #2367 #2382 #2375 #2401 @vagetablechicken, #2387 #2394 @dl239, #2379 @aceforeverd, #2403 @lumianph, #2400 gitpod-for-oss @aceforeverd, )
  • Other minor features (#2363 @aceforeverd, #2185 @qsliu2017)

Bug Fixes

  • APIServer will core dump if no rs in QueryResp. (#2346 @vagetablechicken)
  • Data has not been deleted from pre-aggr tables if there are delete operations in a main table. (#2300 @zhanghaohit)
  • Task jobs will core dump when enabling UnsafeRowOpt with multiple threads in the Yarn cluster. (#2352 #2364 @tobegit3hub)
  • Other minor bug fixes (#2336 @dl239, #2337 @dl239, #2385 #2372 @aceforeverd, #2383 #2384 @vagetablechicken)

Code Refactoring

#2310 @hv789, #2306 #2305 @yeya24, #2311 @Mattt47, #2368 @TBCCC, #2391 @PrajwalBorkar, #2392 @zahyaah, #2405 @wang-jiahua

0.6.0 - 2022-08-10

Highlights

  • Add a new toolkit of managing OpenMLDB, currently including a diagnostic tool and a log collector (#2299 #2326 @dl239 @vagetablechicken)
  • Support aggregate functions with suffix _where using pre-aggregation (#1821 #1841 #2321 #2255 #2321 @aceforeverd @nautaa @zhanghaohit)
  • Support a new SQL syntax of EXCLUDE CURRENT_ROW (#2053 #2165 #2278 @aceforeverd)
  • Add new OpenMLDB ecosystem plugins for DolphinScheduler (#1921 #1955 @vagetablechicken) and Airflow (#2215 @vagetablechicken)

Other Features

  • Support SQL syntax of DELETE in SQL and Kafka Connector (#2183 #2257 @dl239)
  • Support customized order in the insert statement (#2075 @vagetablechicken)
  • Add a new use case of TalkingData AdTracking Fraud Detection (#2008 @vagetablechicken)
  • Improve the startup script to remove mon (#2050 @dl239)
  • Improve the performance of offline batch SQL engine (#1882 #1943 #1973 #2142 #2273 #1773 @tobegit3hub)
  • Support returning version numbers from TaskManager (#2102 @tobegit3hub)
  • Improve the CICD workflow and release procedure (#1873 #2025 #2028 @mangoGoForward)
  • Support GitHub Codespaces (#1922 @nautaa)
  • Support new built-in functions char(int), char_length, character_length, radians, hex, median (#1896 #1895 #1897 #2159 #2030 @wuxiaobai24 @HGZ-20 @Ivyee17)
  • Support returning result set for a new query API (#2189 @qsliu2017)
  • Improve the documents (#1796 #1817 #1818 #2254 #1948 #2227 #2254 #1824 #1829 #1832 #1840 #1842 #1844 #1845 #1848 #1849 #1851 #1858 #1875 #1923 #1925 #1939 #1942 #1945 #1957 #2031 #2054 #2140 #2195 #2304 #2264 #2260 #2257 #2254 #2247 #2240 #2227 #2115 #2126 #2116 #2154 #2152 #2178 #2147 #2146 #2184 #2138 #2145 #2160 #2197 #2198 #2133 #2224 #2223 #2222 #2209 #2248 #2244 #2242 #2241 #2226 #2225 #2221 #2219 #2201 #2291 # 2231 #2196 #2297 #2206 #2238 #2270 #2296 #2317 #2065 #2048 #2088 #2331 #1831 #1945 #2118 @ZtXavier @pearfl @PrajwalBorkar @tobegit3hub @ZtXavier @zhouxh19 @dl239 @vagetablechicken @tobegit3hub @aceforeverd @jmoldyvan @lumianph @bxiiiiii @michelle-qinqin @yclchuxue @redundan3y)

Bug Fixes

  • The SQL engine may produce incorrect results under certain circumstances. (#1950 #1997 #2024 @aceforeverd)
  • The genDDL function generates incorrect DDL if the SQL is partitioned by multiple columns. (#1956 @dl239)
  • The snapshot recovery may fail for disk tables. (#2174 @zhanghaohit)
  • enable_trace does not work for some SQL queries. (#2292 @aceforeverd)
  • Tablets cannot save ttl when updating the ttl of index. (#1935 @dl239)
  • MakeResultSet uses a wrong schema in projection. (#2049 @dl239)
  • A table does not exist when deploying SQL by the APIServer (#2205 @vagetablechicken)
  • The cleanup for ZooKeep does not work properly. (#2191 @mangoGoForward)

Other minor bug fixes (#2052 #1959 #2253 #2273 #2288 #1964 #2175 #1938 #1963 #1956 #2171 #2036 #2170 #2236 #1867 #1869 #1900 #2162 #2161 #2173 #2190 #2084 #2085 #2034 #1972 #1408 #1863 #1862 #1919 #2093 #2167 #2073 #1803 #1998 #2000 #2012 #2055 #2174 #2036 @Xeonacid @CuriousCorrelation @Shigm1026 @jiang1997 @Harshvardhantomar @nautaa @Ivyee17 @frazie @PrajwalBorkar @dl239 @aceforeverd @tobegit3hub @dl239 @vagetablechicken @zhanghaohit @mangoGoForward @SaumyaBhushan @BrokenArrow1404 @harshlancer)

Code Refactoring

#1884 #1917 #1953 #1965 #2017 #2033 #2044 @mangoGoForward; #2131 #2130 #2112 #2113 #2104 #2107 #2094 #2068 #2071 #2070 #1982 #1878 @PrajwalBorkar; #2158 #2051 #2037 #2015 #1886 #1857 @frazie; #2100 #2096 @KikiDotPy; #2089 @ayushclashroyale; #1994 @fpetrakov; #2079 kayverly; #2062 @WUBBBB; #1843 @1korenn; #2092 @HeZean; #1984 @0sirusD3m0n; #1976 @Jaguar16; #2086 @marc-marcos; #1999 @Albert-Debbarma;

0.5.3 - 2022-07-22

Bug Fixes

  • The SQL file cannot be successfully loaded in the Yarn-Client mode. (#2151 @tobegit3hub)
  • The SQL file cannot be successfully loaded in the Yarn-Cluster mode. (#1993 @tobegit3hub)

0.5.2 - 2022-06-10

Features

  • Add new built-in functions, including char_length, char, radians, and replace (#1895 #1896 #1897 @Ivyee17, #1924 @aceforeverd)
  • Add the demo of DolphinScheduler task (#1921 @vagetablechicken)
  • Support inserting values with a specified database name (#1929 @dl239)
  • Improve window computation with UnsafeRowOpt by removing the zipped dataframe (#1882 @tobegit3hub)
  • Improve the documents (#1831 @yclchuxue, #1925 @lumianph, #1902 #1923 @vagetablechicken)
  • Support GitHub Codespaces (#1922 @nautaa)

Bug Fixes

  • DistributeWindowIterator::GetKey() may result in core dump (#1892 aceforeverd)
  • Tablet does not make ttl persistent when updating the ttl of index (#1935 @dl239)
  • TaskManager startup fails if LANG=zh_CN.UTF-8 is set (#1912 @vagetablechicken)
  • There are duplicate records in PRE_AGG_META_INFO (#1919 @nautaa)
  • The OpenMLDB Spark fails to fallback to SparkSQL for unsupported functions (#1908 @tobegit3hub)
  • Fixing other minor bugs (#1914 aceforeverd, #1900 @mangoGoForward, #1934 @vagetablechicken)

Code Refactoring

#1899 @auula, #1913 @dl239, #1917 @mangoGoForward, #1803 @SaumyaBhushan, #1870 @Ivyee17, #1886 @frazie

0.5.1 - 2022-05-26

Features

  • Support the new OpenMLDB Kafka connector (#1771 @vagetablechicken)
  • Support very long SQLs in TaskManager (#1833 @tobegit3hub)
  • Support window union correctly in the cluster mode (#1855 #1856 @aceforeverd @dl239)
  • Support count_where(*, condition) in the storage engine (#1841 @nautaa)
  • Add a new micro-benchmark tool for performance evaluation (#1800 @dl239)

Bug Fixes

  • Auto creating table throws error when a new ttl is greater than the current ttl. (#1737 @keyu813)
  • Offline tasks crash when enabling UnsafeRowOpt for continuous windows. (#1773 @tobegit3hub)
  • The aggregator is not reset if the table is empty. (#1784 @zhanghaohit)
  • The order for window union rows and original rows with the same order key is undefined. (#1802 @aceforeverd)
  • Queries with pre-aggregate enabled may crash under certain tests. (#1838 zhanghaohit)
  • Ending space in CLI may cause program crash. (#1820 @aceforeverd)
  • When creating an engine with empty databases, it cannot execute the command of USE database in the Python SDK. (#1854 @vagetablechicken)
  • When using the soft copy for csv files, it cannot read offline path with options. (#1872 @vagetablechicken)

Code Refactoring

#1766 @hiyoyolumi; #1777 @jmoldyvan; #1779 @SohamRatnaparkhi; #1768 @SaumyaBhushan; #1795 @vighnesh-kadam; #1806 @Mount-Blanc; #1978 @wangxinyu666666; #1781 @SaumyaBhushan; #1786 @xuduling; #1810 @IZUMI-Zu; #1824 @bxiiiiii; #1843 @1korenn; #1851 @zhouxh19; #1862 @Ivyee17; #1867, #1869, #1873, #1884 @mangoGoForward; #1863 @Ivyee17; #1815 @jmoldyvan; #1857 @frazie; #1878 @PrajwalBorkar

0.5.0 - 2022-05-07

Highlights

  • We have introduced an important performance optimization technique of pre-aggregation, which can significantly improve the performance for a query with time windows containing massive amount of rows, e.g., a few millions. (#1532 #1573 #1583 #1622 #1627 #1672 # 1712 @zhanghaohit @nautaa)
  • We have added a new storage engine that supports persistent storage (such as HDD and SSD) for the online SQL engine. Such a storage engine is helpful when a user wants to reduce the cost with acceptable performance degradation. (#1483 @Leowner)
  • We have supported C/C++ based User-Defined Functions (UDFs) with dynamic registration to enhance the development experience. (#1509 #1733 #1700 @dl239 @tobegit3hub)

Other Features

  • Enhance the OpenMLDB Prometheus exporter ( #1584, #1645, #1754 @aceforeverd )
  • Support collecting statistics of query response time for online queries ( #1497, #1521 @aceforeverd )
  • Support new SQL commands: SHOW COMPONENTS, SHOW TABLE STATUS (#1380 #1431 #1704 @aceforeverd)
  • Support setting global variables (#1310 #1359 #1364 @keyu813 @aceforeverd)
  • Support reading Spark configuration files from the CLI (#1600 @tobegit3hub)
  • Support using multiple threads for the Spark local mode (#1675 @tobegit3hub)
  • Enhance the performance of join by using the Spark's native expression (#1502 tobegit3hub)
  • Support the validation for TaskManager configuration (#1262 @tobegit3hub)
  • Support tracking unfinished jobs in the TaskManager (#1474 @tobegit3hub)
  • Other minor features (#1601 @dl239; #1574 @vagetablechicken; #1546 @keyu813; #1729 @vagetablechicken; #1460 @tobegit3hub)

Bug Fixes

  • Incorrect results when the order of conditions specified in where is different from that of the index (#1709 @aceforeverd)
  • Incorrect results of lag/at/lead under certain circumstances (#1605 #1739 @aceforeverd)
  • Memory leakage in zk_client (#1660 @wuxiaobai24)
  • No catalog update if the role of a tablet is changed (#1655 @dl239)
  • Related bugs about UnsafeRow for the offline engine (#1298, #1312, #1326, #1362, #1637, #1381, #1731 @tobegit3hub)
  • Incorrect results after adding a new index in the standalone mode (#1721 @keyu813)
  • Incorrect results of SHOW JOBS under certain circumstances (#1453 @tobegit3hub)
  • Incorrect results of the date columns with UnsafeRowOpt(#1469 @tobegit3hub)
  • Other minor bug fixes (#1698 @kfiring; #1651 @kutlayacar; #1621 @KaidoWang; #1150, #1243 @tobegit3hub; )

Code Refactoring

#1616 @dl239; #1743 @zjx1319

0.4.4 - 2022-04-01

Features

  • Support the standalone version by Java and Python SDKs (#1302 #1325 #1485 @tobegit3hub @HuilinWu2 @keyu813)
  • Support the blocking execution for offline queries (#1486 @vagetablechicken )
  • Add the getStatement API in Java SDK (#1231 @dl239 )
  • Support multiple rows insertion in the Python SDK (#1402 @hezhaozhao-git )
  • Support the JDBC connection (#1511 @vagetablechicken )

Bug Fixes

  • The error message is empty when executing show deployment in CLI fails. (#1415 @dl239 )
  • The show job and show jobs cannot display correct information. (#1440 @vagetablechicken )
  • The built-in function execution on a string filed with the length of greater than 2048 characters causes OpenMLDB crash. (#1540 @dl239 )
  • The simple expression inference fails in some cases (#1443 @jingchen2222 )
  • The PreparedStatement in Java SDK does not perform as expected. (#1511 @vagetablechicken )

Code Refactoring

#1467 @aimanfatima ; #1513 @L-Y-L ; #1503 @Stevinson ;

0.4.3 - 2022-03-15

Features

  • Add the output of the number of rows imported after successfully importing data (#1401 @Manny-op)
  • Code Refactoring (#1366 @Cupid0320; #1378 @wuteek; #1418 @prashantpaidi; #1420 @shiyoubun; #1422 @vagetablechicken)

Bug Fixes

  • Loading online data with "not null" columns in Spark fails. (#1341 @vagetablechicken)
  • max_where and min_where results are incorrect if there is no rows matched. (#1403 @aceforeverd)
  • The insert and select execution of the standalone version fails. (#1426 @dl239)
  • Other minor bug fixes (#1379 @wuteek; #1384 jasleon)

0.4.2 - 2022-03-01

Features

  • Support timestamps in long int when importing a csv file (#1237 @vagetablechicken)
  • Change the default execution mode in CLI from online to offline (#1332 @dl239)
  • Enhancements for the Python SDK:
    • Support fetchmany and fetchall in Python SDK (#1215 @HuilinWu2)
    • Support fetching logs of TaskManager jobs in Python SDK and APIs (#1214 @tobegit3hub)
    • Support fetching the schema of result sets in Python SDK (#1194 @tobegit3hub)
    • Support the SQL magic function in Jupyter Notebook when using the Python SDK. (#1164 @HuilinWu2)
  • Enhancements for the TaskManager:
    • Taskmanager can find the local batchjob jar if the path is not configured. (#1250 @tobegit3hub)
    • Support the Yarn-client mode in TaskManager (#1265 @tobegit3hub)
    • Support correctness checking for TaskManager's configuration (#1262 @tobegit3hub)
    • Support reordering for the task list (#1256 @tobegit3hub)
  • Add new UDF functions of lower and lcase (#1192 @Liu-2001)
  • Offline queries that do not execute on tables will run successfully even when the connection fails. (#1264 @tobegit3hub)

Bug Fixes

  • Offline data import fails when the timestamp value is null. (#1274 @tobegit3hub)
  • Start time of TaskManager jobs in CLI is null. (#1272 @tobegit3hub)
  • LAST JOIN may fail in the cluster version under certain circumstances. (#1226 @dl239)
  • Invalid SQL may run successfully. (#1208 @aceforeverd)

0.4.1 - 2022-02-09

Features

  • Improve CLI error messages and support the 'enable_trace' system variable (#1129 @jingchen2222)

Bug Fixes

  • CLI coredumps when it fails to connect to a nameserver. (#1166 @keyu813)
  • Java SDK has the issue of memory leaks. (#1148 @dl239)
  • The startup fails if a pid file exists. (#1108 @dl239)
  • There are incorrect values for the column with the date type when loading data into an online table. (#1103 @yabg-shuai666)
  • Offline data import for the CSV format may cause incorrect results. (#1100 @yabg-shuai666)
  • 'Offline path' cannot be displayed after importing offline data. (#1172 @vagetablechicken)

0.4.0 - 2022-01-14

Highlights

  • The SQL-centric feature is enhanced for both standalone and cluster versions. Now you can enjoy the SQL-centric development and deployment experience seamlessly. (#991,#1034,#1071,#1064,#1061,#1049,#1045,#1038,#1034,#1029,#997,#996,#968,#946,#840,#830,#814,#776,#774,#764,#747,#740,#466,#481,#1033,#1027,#966,#951,#950,#932,#853,#835,#804,#800,#596,#595,#568,#873,#1025,#1021,#1019,#994,#991,#987,#912,#896,#894,#893,#873,#778,#777,#745,#737,#701,#570,#559,#558,#553 @tobegit3hub; #1030,#965,#933,#920,#829,#783,#754,#1005,#998 @vagetablechicken)
  • The Chinese documentations are thoroughly polished and accessible at https://docs.openmldb.ai/ . This documentation repository is available at https://github.com/4paradigm/openmldb-docs-zh , and you are welcome to make contributions.
  • Experimental feature: We have introduced a monitoring module based on Prometheus + Grafana for online feature processing. (#1048 @aceforeverd)

Other Features

  • Support SQL syntax: LIKE, HAVING (#841 @aceforeverd; #927,#698 @jingchen2222)
  • Support new built-in functions: reverse (#1004 @nautaa), dayofyear (#856 @Nicholas-SR)
  • Improve the compilation and install process, and support building from sources (#999,#871,#594,#752,#793,#805,#875,#871,#999 @aceforeverd; #992 @vagetablechicken)
  • Improve the GitHub CI/CD workflow (#842,#884,#875,#919,#1056,#874 @aceforeverd)
  • Support system databases and tables (#773 @dl239)
  • Improve the function create index (#828 @dl239)
  • Improve the demo image (#1023,#690,#734,#751 @zhanghaohit)
  • Improve the Python SDK (#913,#906 @tobegit3hub;#949,#909 @HuilinWu2; #838 @dl239)
  • Simplify the concepts of execution modes (#877,#985,#892 @jingchen2222)
  • Add data import and export for the cluster version (#1078 @tobegit3hub)
  • Add new deployment command for the cluster version (#921 @dl239)
  • Support default values when creating a table (#563 @zoyopei)
  • Support string delimiters and quotes (#668 @ZackeryWang)
  • Add a new lru_cache to support upsert (#795 @vagetablechicken)
  • Support adding index with any ts_col (#828 @dl239)
  • Improve the ts packing in sql_insert_now (#944 ,#974 @keyu813)
  • Improve documentations (#952 #885 @mahengyang; #834 @Nicholas-SR; #792,#1058,#1002,#872,#836,#792 @lumianph; #844,#782 @jingchen2222; #1022,#805 @aceforeverd)
  • Other minor updates (#1073 @dl239)

Bug Fixes

#847, #831, #647, #934, #953, #1015, #982, #927, #994, #1008, #1028, #1019, #779, #855, #350, #631, #1074, #1073, #1081

@nautaa, @Nicholas-SR, @aceforeverd, @dl239, @jingchen2222, @tobegit3hub, @keyu813

0.3.0 - 2021-11-05

Highlights

We introduce a new standalone mode that can be deployed on a single node, which is suitable for small businesses and the demonstration purpose. Please read more details from here. The standalone mode is particularly enhanced for ease of use based on the following features that are supported by standalone mode only.

Other Features

  • A new CLI command to support different levels of performance sensitivity: SET performance_sensitive=true|false. When it is set to false, SQL queries can be executed without indexes. Please read here for more details about the performance sensitivity configuration 4paradigm#555
  • Supporting SQL queries over multiple databases 4paradigm#476
  • Supporting inserting multiple tuples into a table using a single SQL 4paradigm#398
  • Improvements for Java SDK: The new API getTableSchema 4paradigm#483 The new API genDDL, which is used to generate DDLs according to a given SQL script 4paradigm#588

Bugfix

  • Exceptions caused by certain physical plans with special structures when performing column resolve for logical plans. 4paradigm#437
  • Under specific circumstances, unexpected outcomes produced by SQL queries with the WHERE when certain WHERE conditions do not fit into indexes 4paradigm#599
  • The bug when enabling WindowParallelOpt and WindowSkewOptimization at the same times 4paradigm#444
  • The bug of LCA (Lowest Common Ancestor) algorithm to support WindowParallelOpt for particular SQLs 4paradigm#485
  • Workaround for the Spark bug (SPARK-36932) when the columns with the same names in LastJoin 4paradigm#484

Acknowledgement

We appreciate the contribution to this release from external contributors who are not from 4Paradigm's core OpenMLDB team, including Kanekanekane, shawn-happy, lotabout, Shouren, zoyopei, huqianshan

0.2.3 - 2021-08-31

Feature

  • Data importer support bulk load #250
  • Support parameterized query under BatchMode #262, #168
  • Support Hive metastore and Iceberg tables for offline #245, #146
  • Integrated with Trino #254
  • Support global SortBy node for offline #296

Bug Fix

  • Fix end2end offline tests for the same SQL #300
  • desc do not display the value of ttl after adding index#156

SQL Syntax

  • nvl & nvl2: #238
  • bitwise operators: &, |, ^, ~ #244
  • between predicate: #277

0.2.2 - 2021-08-08

Feature

  • Add VARCHAR Type #237

Bug Fix

  • Fix invalid back qoute identifier name #263 can't write as multiple path style (e.g a.b) now
  • InsertPreparedStatement set month by mistake when use Date type #200

Note:

OPTIONS can't write as multiple path style (e.g a.b) now

0.2.0 - 2021-07-22

Features

  • Refactor front-end using zetasql. Thus OpenMLDB can support more SQL syntaxs and provide friendly syntax error message.
  • Better code style and comment
  • Add APIServer module. User can use Rest API access OpenMLDB.#48

SQL Syntax

Changed

  • table options syntax: #103
  • lead method: #136

Removed

  • || and && as logical operator: #264
  • at function: #265

Note

  • openmldb-0.2.0-linux.tar.gz targets on x86_64
  • aarch64 artifacts consider experimental