Giter Site home page Giter Site logo

climbing-guidebooks's Introduction

Climbing Guidebooks Dataset

This repo provides a CSV dataset about guidebooks published about climbing areas in the United States of America (USA).

How was the data collected?

An initial collection of all guidebooks from three major retailers:

  • Rock and Snow
  • REI
  • Amazon

The Amazon guidebooks were gathered from a search for "guide" in the category Books : Sports & Outdoors : Mountaineering : Rock Climbing

Only climbing guides have been collected. This includes rock climbing, bouldering, ice climbing, and the occasional mountaineering guide if it mentions climbing. Hiking guides and mountaineering guides that do not mention climbing have been excluded.

The initial dataset was then enhanced with data from other retailers and publishers of climbing guidebooks:

Some rabbit holes were followed when searching for data about the books on the initial list, leading to various other sources such as self published books and posts on Mountain Project about guidebooks:

About the dataset

This dataset is provided as a comma-separated values (CSV) file with the following columns:

column name format details
title string title of the guidebook
year string year the guidebook was published
authors multi-value comma-separated strings author(s) of the guidebook
note string miscellaneous information about the guidebook, such as whether it's out of print or digital only
edition integer number of the book edition. For example, 1 for the first edition of a book.
editors multi-value comma-separated strings editor(s) of the guidebook, if known
publisher string publisher of the guidebook
ISBN string ISBN-13 of the guidebook, occasionally ISBN-10.

Notes about missing data

Given this initially standard and then chaotically deviating methodology, there are certainly gaps in this dataset in terms of:

  • Edition coverage. Not all book editions are captured.
  • ISBNs. Not all ISBNs could be identified, and not all books have ISBNs, especially self-published books. Even when the ISBN exists, it might not be clearly documented online.
  • Years. Not all publication years could be identified.
  • Publishers. Not all book publishers could be identified.
  • Editors. Data on editors was collected only when easily available and known.

Because this research was conducted using what is available from online retailers, missing information that might be available to book owners was often unable to be verified (the data collectors do own some guidebooks!) .

If you want to help, see How to contribute to this dataset.

Notes on standardization of the data

In some cases, the data in the dataset might not match the ISBN record data or information for a specific book or edition. Some decisions were made to standardize the data to make comparisons easier to perform. For example:

  • Author name spelling. For authors with multiple books and multiple name spellings, the most common name spelling was chosen to represent the author in the dataset. For example, Peter Hubbell (sometimes Hubbel).
  • Self published books were noted as "Self Published", rather than listing the author name.

Who collected the data?

  • Kimbrough Moore, climbing guidebook author and philosophy professor, collected the initial dataset.
  • Sarah Moir, technical writer and data analysis dabbler, expanded and standardized the dataset.

Why was the data collected?

To gather a better understanding of who is writing climbing guidebooks.

How to contribute to this dataset

There are a lot of gaps in this dataset, and we welcome contributions of:

  • Guidebooks missing from this dataset. Please include as many column values as possible.
  • Data missing for guidebooks in this dataset, such as year, publisher, or editor.

How you contribute depends on how comfortable you are using GitHub:

  • If you're comfortable with GitHub, open a pull request to add or update the data in the CSV file.
  • If you have a GitHub account but don't want to open a pull request, open a GitHub issue with as much detail as possible about the data that you want added.

To ensure data integrity, please include verifying information such as an ISBN number, or a photo of a publishing page for the guidebook.

How to cite this dataset

Authors: Kimbrough Moore and Sarah Moir

Title: Climbing Guidebook for USA Dataset

Year Created: 2023

This dataset uses the MIT license.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.