Giter Site home page Giter Site logo

Querying Parquet files about go-duckdb HOT 2 CLOSED

marcboeker avatar marcboeker commented on July 24, 2024
Querying Parquet files

from go-duckdb.

Comments (2)

marcboeker avatar marcboeker commented on July 24, 2024

Have you tried running INSTALL parquet; LOAD parquet beforehand like in https://github.com/marcboeker/go-duckdb/blob/master/duckdb_test.go#L910-L912?

from go-duckdb.

peterinfra avatar peterinfra commented on July 24, 2024

Thanks for the pointer. I am able to follow along the example from the JSON extension and use it for parquet.

import (
	"context"
	"database/sql"
	"database/sql/driver"
	"fmt"
	"log"

	"github.com/marcboeker/go-duckdb"
)

var db *sql.DB

func check(args ...interface{}) {
	err := args[len(args)-1]
	if err != nil {
		panic(err)
	}
}

func loadParquet(execer driver.ExecerContext) error {
	bootQueries := []string{
		"INSTALL 'parquet'",
		"LOAD 'parquet'",
	}

	for _, qry := range bootQueries {
		_, err := execer.ExecContext(context.TODO(), qry, nil)
		if err != nil {
			return err
		}
	}
	return nil
}

type data struct {
	c int64
}

func main() {
	connector, err := duckdb.NewConnector("", loadParquet)
	if err != nil {
		fmt.Println("Failed to load Parquet")
		log.Fatal(err)
	}

	db := sql.OpenDB(connector)
	if err != nil {
		log.Fatal(err)
	}
	defer db.Close()
	check(db.Ping())
	rows, err := db.Query(`select count(*) as c from read_parquet('my.parquet');`)
	check(err)
	defer rows.Close()
	for rows.Next() {
		c := new(data)
		err := rows.Scan(&c.c)
		if err != nil {
			log.Fatal(err)
		}
		log.Printf("Count = %d", c.c)
		fmt.Println(c.c)
	}

}

from go-duckdb.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.