<thead id="b45a5"><del id="b45a5"><progress id="b45a5"></progress></del></thead>

<samp id="b45a5"><del id="b45a5"></del></samp>

<style id="b45a5"><progress id="b45a5"></progress></style>

溫馨提示×

溫馨提示×

您好，登錄后才能下訂單哦！

密碼登錄×

忘記密碼？

登錄注冊×

獲取短信驗證碼

其他方式登錄

點擊登錄注冊即表示同意《億速云用戶服務(wù)條款》

用戶登錄×

賬戶密碼登錄

請使用微信掃描上方二維碼

使用幫助

請求超時！

請點擊重新獲取二維碼

Java 嵌入 SPL 輕松實現(xiàn)數(shù)據(jù)分組

發(fā)布時間：2020-07-08 02:03:05 來源：網(wǎng)絡(luò) 閱讀：398 作者：raqsoft 欄目：大數(shù)據(jù)

要在 Java 代碼中實現(xiàn)類似 SQL 中的 GroupBy 分組聚合運算，是比較繁瑣的，通常先要聲明數(shù)據(jù)結(jié)構(gòu)（Java 實體類），然后用 Java 集合進(jìn)行循環(huán)遍歷，最后根據(jù)分組條件添加到某個子集合中。Java 8 有了 Lambda（stream）代碼簡潔了許多，分組后往往還要跟著聚合操作，仍然需要單寫聚合函數(shù) sum(),count(*),topN()等。這些還都是最常規(guī)的分組和聚合運算，遇到對位分組、枚舉分組、多重分組等非常規(guī)分組加上其他聚集函數(shù) (FIRST，LAST…)，代碼就變得非常冗長且不通用。如果能有一個中間件專門負(fù)責(zé)這類計算，采用類似 SQL 腳本做算法描述，在 Java 中直接調(diào)用腳本并返回結(jié)果集就好了。Java 版集算器和 SPL 腳本，就是這樣的機制，下面舉例說明如何使用。

SPL 實現(xiàn)

常規(guī)分組

duty.xlsx 文件中保存著每個人的加班記錄:

workday	name
2016-02-05	Ashley
2016-02-08	Ashley
2016-02-09	Ashley
2016-02-10	Johnson
2016-02-11	Johnson
2016-02-12	Johnson
2016-02-15	Ashley
2016-02-16	Ashley
2016-02-17	Ashley

匯總每個人的值班天數(shù)：

	A
1	=file("/Users/test/duty.xlsx").importxls@tx()
2	=A1.groups(name;count(name):count)

Java 嵌入 SPL 輕松實現(xiàn)數(shù)據(jù)分組

保存腳本文件CountName.dfx(嵌入 Java 會用到)

每組 TopN

取每個月、每個人、頭三天的加班記錄

	A
1	=file("/Users/test/duty.xlsx").importxls@tx()
2	=A1.group(month(workday):mon,name;~.top(3):top3)

Java 嵌入 SPL 輕松實現(xiàn)數(shù)據(jù)分組

保存腳本文件RecMonTop3.dfx(嵌入 Java 會用到)

Java 調(diào)用

SPL 嵌入到 Java 應(yīng)用程序十分方便，通過 JDBC 調(diào)用存儲過程方法加載，用常規(guī)分組保存的文件CountName.dfx，示例調(diào)用如下：

...  Connection con = null;  Class.forName("com.esproc.jdbc.InternalDriver");  con= DriverManager.getConnection("jdbc:esproc:local://"); //調(diào)用存儲過程，其中CountName是dfx的文件名  st =(com. esproc.jdbc.InternalCStatement)con.prepareCall("call CountName()");  //執(zhí)行存儲過程  st.execute();  //獲取結(jié)果集  ResultSet rs = st.getResultSet(); ......  Connection con = null;  Class.forName("com.esproc.jdbc.InternalDriver");  con= DriverManager.getConnection("jdbc:esproc:local://"); //調(diào)用存儲過程，其中CountName是dfx的文件名  st =(com. esproc.jdbc.InternalCStatement)con.prepareCall("call CountName()");  //執(zhí)行存儲過程  st.execute();  //獲取結(jié)果集  ResultSet rs = st.getResultSet(); ...

替換成 RecMonTop3.dfx 是同樣的道理，只需 call RecMonTop3() 即可，也可同時返回兩個結(jié)果集。這里只用 Java 片段粗略解釋了如何嵌入 SPL，詳細(xì)步驟請參閱 Java 如何調(diào)用 SPL 腳本，也非常簡單，不再贅述。同時，SPL 也支持 ODBC 驅(qū)動，集成到支持 ODBC 的語言，嵌入過程類似。

拓展節(jié)選

之前沒有相關(guān)的總結(jié)，其實關(guān)于數(shù)據(jù)分組，細(xì)分起來其實還有很多種，對位分組、枚舉分組、多重分組…，在乾學(xué)院 SPL 官方論壇都有總結(jié)和示例，這里節(jié)選其中兩種。

SPL 對位分組

示例 1：按順序分別列出使用 Chinese、English、French 作為官方語言的國家數(shù)量

MySQL8: with t(name,ord) as (select 'Chinese',1 union all select 'English',2 union all select 'French',3) select t.name, count(countrycode) cnt from t left join world.countrylanguage s on t.name=s.language where s.isofficial='T' group by name,ord order by ord;MySQL8: with t(name,ord) as (select 'Chinese',1 union all select 'English',2 union all select 'French',3) select t.name, count(countrycode) cnt from t left join world.countrylanguage s on t.name=s.language where s.isofficial='T' group by name,ord order by ord;

注意：表的字符集和數(shù)據(jù)庫會話的字符集要保持一致。

(1) show variables like ’character_set_connection’查看當(dāng)前會話字符集

(2) show create table world.countrylanguage 查看表的字符集

(3) set character_set_connection=[字符集] 更新當(dāng)前會話字符集

集算器 SPL:

	A
1	=connect("mysql")
2	=A1.query@x("select * from world.countrylanguage where isofficial='T'")
3	[Chinese,English,French]
4	=A2.align@a(A3,Language)
5	=A4.new(A3(#):name, ~.len():cnt)

A1: 連接數(shù)據(jù)庫

A2: 查詢出所有官方語言的記錄

A3: 需要列出的語言

A4: 將所有記錄按 Language 對位到 A3 相應(yīng)位置

A5: 構(gòu)造以語言和使用此語言為官方語言的國家數(shù)量的序表

Java 嵌入 SPL 輕松實現(xiàn)數(shù)據(jù)分組

示例 2：按順序分別列出使用 Chinese、English、French 及其它語言作為官方語言的國家數(shù)量

MySQL8: with t(name,ord) as (select 'Chinese',1 union all select 'English',2 union all select 'French',3 union all select 'Other', 4), s(name, cnt) as ( select language, count(countrycode) cnt from world.countrylanguage s where s.isofficial='T' and language in ('Chinese','English','French') group by language union all select 'Other', count(distinct countrycode) cnt from world.countrylanguage s where isofficial='T' and language not in ('Chinese','English','French') ) select t.name, s.cnt from t left join s using (name) order by t.ord;MySQL8: with t(name,ord) as (select 'Chinese',1 union all select 'English',2 union all select 'French',3 union all select 'Other', 4), s(name, cnt) as ( select language, count(countrycode) cnt from world.countrylanguage s where s.isofficial='T' and language in ('Chinese','English','French') group by language union all select 'Other', count(distinct countrycode) cnt from world.countrylanguage s where isofficial='T' and language not in ('Chinese','English','French') ) select t.name, s.cnt from t left join s using (name) order by t.ord;

集算器 SPL:

	A
1	=connect("mysql")
2	=A1.query@x("select * from world.countrylanguage where isofficial='T'")
3	[Chinese,English,French,Other]
4	=A2.align@an(A3.to(3),Language)
5	=A4.new(A3(#):name, if(#<=3,~.len(), ~.icount(CountryCode)):cnt)

A4: 將所有記錄按 Language 對位到 A3.to(3) 相應(yīng)位置，并追加一組用于存放不能對位的記錄

A5: 第 4 組計算不同 CountryCode 的數(shù)量

Java 嵌入 SPL 輕松實現(xiàn)數(shù)據(jù)分組

SPL 枚舉分組

示例 1：按順序列出各類型城市的數(shù)量

MySQL8: with t as (select * from world.city where CountryCode='CHN'), segment(class,start,end) as (select 'tiny', 0, 200000 union all select 'small',  200000, 1000000 union all select 'medium', 1000000, 2000000 union all select 'big', 2000000, 100000000 ) select class, count(1) cnt from segment s join t on t.population>=s.start and t.population<s.end group by class, start order by start;MySQL8: with t as (select * from world.city where CountryCode='CHN'), segment(class,start,end) as (select 'tiny', 0, 200000 union all select 'small',  200000, 1000000 union all select 'medium', 1000000, 2000000 union all select 'big', 2000000, 100000000 ) select class, count(1) cnt from segment s join t on t.population>=s.start and t.population<s.end group by class, start order by start;

集算器 SPL:

	A
1	=connect("mysql")
2	=A1.query@x("select * from world.city where CountryCode='CHN'")
3	=${string([20,100,200,10000].(~*10000).("?<"/~))}
4	[tiny,small,medium,big]
5	=A2.enum(A3,Population)
6	=A5.new(A4(#):class, ~.len():cnt)

A3: ${…} 宏替換，以大括號內(nèi)表達(dá)式的結(jié)果作為新表達(dá)式進(jìn)行計算，結(jié)果為序列 [“?<200000”,“?<1000000”,“?<2000000”,“?<100000000”]

A5: 針對 A2 中每條記錄，尋找 A3 中第 1 個成立的條件，并追加到對應(yīng)的組中

Java 嵌入 SPL 輕松實現(xiàn)數(shù)據(jù)分組

示例 2：列出華東地區(qū)大型城市數(shù)量、其它地區(qū)大型城市數(shù)量、非大型城市數(shù)量

MySQL8: with t as (select * from world.city where CountryCode='CHN') select 'East&Big' class, count(*) cnt from t where population>=2000000 and district in ('Shanghai','Jiangshu', 'Shandong','Zhejiang','Anhui','Jiangxi') union all select 'Other&Big', count(*) from t where population>=2000000 and district not in ('Shanghai','Jiangshu','Shandong','Zhejiang','Anhui','Jiangxi') union all select 'Not Big', count(*) from t where population<2000000;MySQL8: with t as (select * from world.city where CountryCode='CHN') select 'East&Big' class, count(*) cnt from t where population>=2000000 and district in ('Shanghai','Jiangshu', 'Shandong','Zhejiang','Anhui','Jiangxi') union all select 'Other&Big', count(*) from t where population>=2000000 and district not in ('Shanghai','Jiangshu','Shandong','Zhejiang','Anhui','Jiangxi') union all select 'Not Big', count(*) from t where population<2000000;

集算器 SPL:

	A
1	=connect("mysql")
2	=A1.query@x("select * from world.city where CountryCode='CHN'")
3	[Shanghai,Jiangshu, Shandong,Zhejiang,Anhui,Jiangxi]
4	[?(1)>=2000000 && A3.contain(?(2)), ?(1)>=2000000 && !A3.contain(?(2))]
5	[East&Big,Other&Big, Not Big]
6	=A2.enum@n(A4, [Population,District])
7	=A6.new(A5(#):class, A6(#).len():cnt)

A5: enum@n 將不滿足 A4 中所有條件的記錄存放到追加的最后一組中

Java 嵌入 SPL 輕松實現(xiàn)數(shù)據(jù)分組

示例 3：列出所有地區(qū)大型城市數(shù)量、華東地區(qū)大型城市數(shù)量、非大型城市數(shù)量

MySQL8: with t as (select * from world.city where CountryCode='CHN') select 'Big' class, count(*) cnt from t where population>=2000000 union all select 'East&Big' class, count(*) cnt from t where population>=2000000 and district in ('Shanghai','Jiangshu','Shandong','Zhejiang','Anhui','Jiangxi') union all select 'Not Big' class, count(*) cnt from t where population<2000000;MySQL8: with t as (select * from world.city where CountryCode='CHN') select 'Big' class, count(*) cnt from t where population>=2000000 union all select 'East&Big' class, count(*) cnt from t where population>=2000000 and district in ('Shanghai','Jiangshu','Shandong','Zhejiang','Anhui','Jiangxi') union all select 'Not Big' class, count(*) cnt from t where population<2000000;

集算器 SPL:

	A
1	=connect("mysql")
2	=A1.query@x("select * from world.city where CountryCode='CHN'")
3	[Shanghai,Jiangshu, Shandong,Zhejiang,Anhui,Jiangxi]
4	[?(1)>=2000000, ?(1)>=2000000 && A3.contain(?(2))]
5	[Big, East&Big, Not Big]
6	=A2.enum@rn(A4, [Population,District])
7	=A6.new(A5(#):class, A6(#).len():cnt)

A6: 若 A2 中記錄滿足 A4 中多個條件時，enum@r 會將其追加到對應(yīng)的每個組中

Java 嵌入 SPL 輕松實現(xiàn)數(shù)據(jù)分組

優(yōu)勢總結(jié)

有庫寫 SQL，沒庫寫 SPL
用 Java 程序直接匯總計算數(shù)據(jù)，還是比較累的，代碼很長，并且不可復(fù)用，很多情況數(shù)據(jù)也不在數(shù)據(jù)庫里，有了 SPL，就能像在 Java 中用 SQL 一樣了，十分方便。
常用無憂，不花錢就能取得終身使用權(quán)的入門版
如果要分析的數(shù)據(jù)是一次性或臨時性的，潤乾集算器每個月都提供免費試用授權(quán)，可以循環(huán)免費使用。但要和 Java 應(yīng)用程序集成起來部署到服務(wù)器上長期使用，定期更換試用授權(quán)還是比較麻煩，潤乾提供了有終身使用權(quán)的入門版，解決了這個后顧之憂，獲得方式參考如何免費使用潤乾集算器？
技術(shù)文檔和社區(qū)支持
官方提供的集算器技術(shù)文檔本身就有很多現(xiàn)成的例子，常規(guī)問題從文檔里都能找到解決方法。如果獲得了入門版，不僅能夠使用 SPL 的常規(guī)功能，碰到任何問題都可以去乾學(xué)院上去咨詢，官方通過該社區(qū)對入門版用戶提供免費的技術(shù)支持。

向AI問一下細(xì)節(jié)

推薦閱讀：

免責(zé)聲明：本站發(fā)布的內(nèi)容（圖片、視頻和文字）以原創(chuàng)、轉(zhuǎn)載和分享為主，文章觀點不代表本網(wǎng)站立場，如果涉及侵權(quán)請聯(lián)系站長郵箱：is@yisu.com進(jìn)行舉報，并提供相關(guān)證據(jù)，一經(jīng)查實，將立刻刪除涉嫌侵權(quán)內(nèi)容。

上一篇新聞：
python字符編碼
下一篇新聞：
表單提交前顯示確認(rèn)窗口

猜你喜歡

AI
助
手

產(chǎn)品服務(wù)

地區(qū)劃分

專題活動

幫助支持

關(guān)于我們

售后咨詢

7*24小時在線電話：400-100-2938

7*24小時在線 QQ：800811969

關(guān)注億速云

億速云公眾號

手機網(wǎng)站二維碼

<samp id="ykx3j"><del id="ykx3j"></del></samp>

<label id="ykx3j"><nav id="ykx3j"></nav></label>

<table id="ykx3j"></table>

<samp id="ykx3j"><del id="ykx3j"></del></samp>