Giter Site home page Giter Site logo

athyg-database's Introduction

Augmented Tycho - HYG (AT-HYG)

Rationale

HIPPARCOS (and thus HYG) is getting a little dated, while Gaia (its ultimate replacement for many purposes) is absolutely gigantic -- 1.3 billion records -- and too large for many general-purpose applications. For this update, I wanted to accomplish the following:

  1. Incorporate Gaia results into the old but popular HYG dataset.
  2. Use a well-defined modern star catalog as a primary basis for IDs, rather than the somewhat ad hoc system in HYG.
  3. Create a larger database than HYG that is suitable for today's faster systems, but still small enough to be manageable in many applications.

I chose the Tycho-2 catalog for this purpose. The Tycho-2 catalog's size (~2.5M records) and coverage (essentially complete to V = 11, mostly complete to V = 11.5) makes it suitable for many current applications. It's large enough to be comprehensive for all but the deepest charts, but small enough to be manageable in fairly simple applications.

Tycho-2 (released in 2000) is only slightly younger than HIPPARCOS, so the idea was to combine it with the newer Gaia DR3 results (2020-2022) whenever feasible. This gives high-quality distance information for over 2M stars instead of the lower-precision results from HIPPARCOS for 0.1M. Adding Gaia results to Tycho-2 gives an "Augmented" Tycho-2 dataset (AT), which is the basis for most stars in the new catalog.

By merging AT with HYG, the "classical" IDs and names for a large number of Tycho / Gaia stars are also available. The result is AT-HYG.

License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

This license applies to versions v3.0 and later. Older versions (prior to v3.0) are licensed under CC BY-SA 2.5.

Download Format

Full Catalog (version 3.1 - data/athyg_v31-*.csv)

The full catalog, even when compressed, is too large for simple hosting in this repository. It is currently split into 2 components, which should be downloaded, uncompressed, and concatenated, e.g. (Linux command for version 3.1):

cat athyg_v31-1.csv athyg_v31-2.csv > athyg_v31.csv

The full CSV can then be imported into the database tool of your choosing.

Subset files (version 3.1 - data/subsets)

Several smaller subset files are available if the full catalog is still too large. One of these, the "HYGLike" subset, is designed to have a similar schema to the HYG Database and be a drop-in replacement for it in many applications. The other subset files are simple extracts from the main AT-HYG that have a lower (brighter) magnitude cutoff for most stars.

See data/subsets/README.md for more details about these files.

Current Version:

The current version of AT-HYG is version v3.1 (data/athyg_v31-*.csv.gz).

Changes from version 3.0:

Incorporate changes from HYG v4.0 to v4.1

The update for v3.1 adds 11 new proper names for stars that correspond directly to 11 new proper names in HYG v4.1, specifically to components of double or multiple stars where a specific proper name for the "B" component was desired. No other data has been changed.

Comparison to HYG

The HYG catalog lists proper motions and radial velocities in the same units as AT-HYG (milliarcsec/year and km/sec respectively), but has the Cartesian velocities vx, vy, and vz in parsecs per year. AT-HYG uses kilometers per second, in part to reduce the space needed for figures in CSV format (the figures in pc/year have 5 or 6 leading zeroes) and in part to make direct comparisons to the radial velocity simple. The conversion factor for converting km/sec to pc/year is 1 km/sec = 1.02269 E-6 pc/year.

Apart from the difference in units for vx, vy, and vz, and a few slightly different field names, AT-HYG 3.1 is almost a drop-in replacement for HYG 3.x or HYG 4.x for many applications -- in particular, for any application that needs only the stars' catalog IDs, historical IDs (like Henry Draper or traditional names), magnitudes, positions, and velocities. The fields missing from AT-HYG that are present in HYG are:

  • Variable star IDs and approximate magnitude ranges
  • Explicit identification of components in multiple stars (in terms of a "base ID" representing the primary and a component ID)

Other Versions:

See version-info.md for more details on previous versions of the catalog.

athyg-database's People

Contributors

astronexus avatar

Stargazers

 avatar Josh Mize avatar Rotten/Amal avatar  avatar Rodrigo Braz Monteiro avatar Yücel Kılıç avatar  avatar Celestialchip.rs avatar  avatar Jed Whitten avatar Mr. Frog avatar  avatar Sergio Díaz avatar Edwin Kofler avatar Brendan Gallagher avatar Devin Ganger avatar  avatar  avatar Marton Nagy avatar 慕芝er avatar Jason Neel avatar  avatar Dan Smith avatar Daniel Williams avatar Victor avatar  avatar

Watchers

 avatar Sergio Díaz avatar  avatar  avatar

Forkers

patrickcnkm

athyg-database's Issues

Protobuf Support

Hi,

I was wondering if there were any plans to export a version of the datasets in a binary format such as Google's protocol buffers?

It would greatly improve the efficiency of data processing and storage, and offers excellent interoperability across different systems and languages.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.