溫馨提示×

溫馨提示×

您好,登錄后才能下訂單哦!

密碼登錄×
登錄注冊×
其他方式登錄
點(diǎn)擊 登錄注冊 即表示同意《億速云用戶服務(wù)條款》

join on different data type

發(fā)布時間:2020-06-01 21:06:00 來源:網(wǎng)絡(luò) 閱讀:836 作者:r7raul 欄目:大數(shù)據(jù)

SELECT ....
FROM A LEFT SEMI JOIN B
ON (A.col1 = B.col2)
WHERE ...
"
If A.col1 is of DOUBLE type, but B.col2 is of BIGINT,  will print WARNING: Comparing a bigint and a double may result in a loss of precision.   Why can't cast col2 to double automatically?


> If A.col1 is of DOUBLE type,

> but B.col2 is of BIGINT,

 

 

The automatic conversion is not acceptable according to the java language

spec (section 5.1.2)

 

https://docs.oracle.com/javase/specs/jls/se7/html/jls-5.html#jls-5.1.2

 

 

Also to be noted here is that in general, that even if you cast, you might

be casting the wrong way around.

 

Because joins on double columns will give incorrect (rather unintended,

but IEEE 754 correct) results when comparing byte serialized

representations - because of the nearly-equal property epsilon.

 

Easiest way to demonstrate this is to try the simplest off-by-epsilon case

(say, in python)

 

>>> import sys

>>> 0.1 + 0.2

0.30000000000000004

>>> 0.1 + 0.2 > 0.3

 

True

>>>

>>> ((0.1+0.2) - 0.3) < sys.float_info.epsilon

True

 

 

So if the RHS produced ETL values by sum() and the LHS was produced by

parsing log text, the JOIN will output zero rows.

 

If you want to do equijoins like that, the only valid case is to cast both

to fixed precision bigints (say, convert all dollars to cents, by *100),

not both to double.

 

Cheers,

Gopal



向AI問一下細(xì)節(jié)

免責(zé)聲明:本站發(fā)布的內(nèi)容(圖片、視頻和文字)以原創(chuàng)、轉(zhuǎn)載和分享為主,文章觀點(diǎn)不代表本網(wǎng)站立場,如果涉及侵權(quán)請聯(lián)系站長郵箱:is@yisu.com進(jìn)行舉報,并提供相關(guān)證據(jù),一經(jīng)查實,將立刻刪除涉嫌侵權(quán)內(nèi)容。

AI