Comments (6)
this is the relational algebra it generates. I am not sure what the semi join would generate but I am guessing something very similar.
LogicalJoin(condition=[=($0, $4)], joinType=[inner])
LogicalTableScan(table=[[main, nation]])
LogicalAggregate(group=[{0}])
BindableTableScan(table=[[main, nation]], filters=[[=($0, 16)]], projects=[[0]], aliases=[[n_nationkey]])
from blazingsql.
In the future we will be able to optimize out LogicalAggregate(group=[{0}])
using the CBO when the column in question is unique but that is currently not supported.
from blazingsql.
I just tried the following and it worked for me:
import cudf
bc = BlazingContext()
bc.s3('bsql_data', bucket_name='blazingsql-colab', access_key_id='AKIAJGB3SR3IXU3TE5WA', secret_key='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')
bc.create_table('nation', 's3://bsql_data/tpch_sf1/nation/0_0_0.parquet')
result = bc.sql('select * from nation where nation.n_nationkey in ( select other.n_nationkey from nation as other where n_nationkey = 16)').get()
print(result.columns)
The output i got was
0 16 MOZAMBIQUE 0
n_comment
0 s. ironic, unusual asymptotes wake blithely r ```
from blazingsql.
Can you show me a complete example where this is not working how you would expect?
from blazingsql.
Sorry for being unclear ,
I would like left-join
to work natively i.e , i would like below sql query to work.
SELECT e.EmpName, e.DepID
FROM @employees AS e
LEFT SEMIJOIN (SELECT (int?) DepID AS DepID, DepName FROM @departments) AS d
ON e.DepID == d.DepID;
Question
Will there be performance implications for using SELECT * FROM A WHERE A.key IN (SELECT B.key FROM B) pattern
instead of left-semi join
or do you expect the performance to remain the same ?
from blazingsql.
this is the relational algebra it generates. I am not sure what the semi join would generate but I am guessing something very similar.
LogicalJoin(condition=[=($0, $4)], joinType=[inner]) LogicalTableScan(table=[[main, nation]]) LogicalAggregate(group=[{0}]) BindableTableScan(table=[[main, nation]], filters=[[=($0, 16)]], projects=[[0]], aliases=[[n_nationkey]])
Thanks for looking into this, will update on this issue if using this pattern is not as performent as we would like on my use-case .
from blazingsql.
Related Issues (20)
- [BUG] Error when reading table with hive cursor that does not happen with hdfs
- [BUG] Logs being admitted despite settings causing HA failures HOT 3
- OK
- [BUG] e2e crash running concatSuite
- [BUG] Enables/fix HiveFileTest TEST_01
- [BUG]SHA hashing function not working in Blazing SQL HOT 3
- [BUG] Add temporary DECIMAL support by using float64 instead
- [BUG] blazingsql 21.08 not available on conda, while rest of rapids is released as planned
- [BUG] blazingsql take a lot of disk storage
- Bug when join 2 table with [BUG]
- Add support for H3
- [BUG] Very verbose bsql connection
- [BUG] byte_range offset with header not supported
- [BUG] [SECURITY] Log4j CVE Metaissue HOT 3
- FileNotFoundError: [Errno 2] No such file or directory: 'blazingsql-orchestrator': 'blazingsql-orchestrator'
- [BUG] app.blazing.com website not reachable HOT 1
- [BUG] Cannot import BlazingContext when processor type unknown HOT 2
- http://app.blazingsql.com/ does not work
- Does blazingsql support GPU passthrough?
- [QST] Is blazingSQL still active? HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from blazingsql.