Giter Site home page Giter Site logo

tomltable's Introduction

tomltable: generate LaTeX tables from TOML specifications and JSON data

This Python package provides a command-line utility called tomltable. This utility generates a LaTeX table from a TOML formatted table specification (read from stdin) and a set of JSON files (specified as arguments).

tomltable can accommodate JSON files regardless of how they are structured. However, its specification language also provides a convenience shortcut for regression tables. There are two related packages that save regression results in a structure that is compatible with tomltable:

  1. jsonwriter for R and
  2. json_this for Stata.

The spiritual ancestor of this package is coeftable which performed a similar task but without the TOML-based specification language.

Installation

tomltable is available through PyPI so you can install it with pip:

$ python3 -m pip install tomltable

You can also install it directly from the Codeberg Git repository by typing either

$ python3 -m pip install --user git+https://codeberg.org/gnyeki/tomltable

or

$ git clone https://codeberg.org/gnyeki/tomltable
$ python3 -m pip install --user ./tomltable

Usage

Generating a regression table

We will generate the following table:

Preview of example_mag.tex

To start with, it is convenient to save regression results with jsonwriter or json_this. An example that uses the former in R:

# example_mag.R

install.packages("devtools")
devtools::install_git(url = "https://codeberg.org/gnyeki/jsonwriter")

library(fixest)
library(jsonwriter)

data(quakes)

# Regression on all earthquakes.
#
write_json(
    feols(depth ~ mag, quakes),
    "example_model_1.json")

# Regression on earthquakes that were reported by a below-median number
# of stations.
#
write_json(
    feols(depth ~ mag, quakes[quakes$stations <= 27, ]),
    "example_model_2.json")

# Regression on earthquakes that were reported by an above-median number
# of stations.
#
write_json(
    feols(depth ~ mag, quakes[quakes$stations > 27, ]),
    "example_model_3.json")

Let's specify a table with three columns, one for each JSON file that is generated by the R script above. We add column numbers and the name of the outcome variable to the header, and the R-squared along with the number of observations to the footer:

# example_mag.toml

[header]
add-column-numbers = true

[[header.row]]
cell = ["Depth", "Depth", "Depth"]

[[body.cell]]
label = "Magnitude"
coef = "mag"

[[footer.cell]]
label = "Observations"
cell = "%(n::nobs)d"

[[footer.cell]]
label = "$R^2$"
cell = "%(n::r_squared).03f"

[[footer.row]]
label = "Sample"
cell = ["full", 'stations $\leq 27$', "stations $> 27$"]

Now we can run the R script and generate the regression table with tomltable:

$ Rscript example_mag.R
$ cat example_mag.toml \
    | tomltable \
        -j example_model_1.json \
        -j example_model_2.json \
        -j example_model_3.json \
        --title "Earthquake depth and magnitude" \
        --label tab:quakes \
        --human-readable-numbers \
    > example_mag.tex

Only generating a template

Use the --only-table option if you only want to generate the template, not the final table:

$ cat example_mag.toml \
    | tomltable \
        -j example_model_1.json \
        -j example_model_2.json \
        -j example_model_3.json \
        --title "Earthquake depth and magnitude" \
        --label tab:quakes \
        --only-template \
    > example_mag.tmpl

Note that the --human-readable-numbers option is dropped. This option is only used for generating the final table.

Using a template to generate a table

Use the --from-template option if you want to use a pre-specified template instead of generating the template from a TOML specification:

$ cat example_mag.tmpl \
    | tomltable \
        -j example_model_1.json \
        -j example_model_2.json \
        -j example_model_3.json \
        --from-template \
        --human-readable-numbers \
    > example_mag.tex

Note that the --title and the --label options are dropped. These options are only used for template generation.

Generating a regression table with column-specific coefficients

We will generate the following table:

Preview of example_mag_squared.tex

In the above table, the coefficient for Magnitude squared is only present in column (2). Because it is missing in the JSON file for column (1), we will need to call tomltable with the --ignore-missing-keys option.

To make the example complete, the following R script generates the results for column (2):

# example_mag_squared.R

library(fixest)
library(jsonwriter)

data(quakes)

write_json(
    feols(depth ~ mag + mag^2, quakes),
    "example_model_4.json")

The TOML specification for the table is similar to before. The most notable difference is in the new [[body.cell]] block for Magnitude squared:

# example_mag_squared.toml

[header]
add-column-numbers = true

[[header.row]]
cell = ["Depth", "Depth"]

[[body.cell]]
label = "Magnitude"
coef = "mag"

[[body.cell]]
label = "Magnitude squared"
coef = "I(mag^2)"

[[footer.cell]]
label = "$R^2$"
cell = "%(n::r_squared).03f"

[[footer.cell]]
label = "Observations"
cell = "%(n::nobs)d"

Running the R script and generating the table, tomltable gives us warning messages because Magnitude squared is missing for column (1). The missing elements in the table are simply omitted in the output, leaving some dangling text in the affected cells. To remove these, we use sed:

$ Rscript example_mag_squared.R
$ cat example_mag_squared.toml \
    | tomltable \
        -j example_model_1.json \
        -j example_model_4.json \
        --title "Earthquake depth and magnitude" \
        --label tab:quakes \
        --human-readable-numbers \
        --ignore-missing-keys \
    | sed \
        -e "s/ & \$\$/ \& /" \
        -e "s/ & ()/ \& /" \
    > example_mag_squared.tex
warning: Specifier '%(1::coef::I(mag^2)::est).03f' refers to key '1::coef::I(mag^2)::est' but this key is not in the JSON object.
warning: Specifier '%(1::coef::I(mag^2)::stars)s' refers to key '1::coef::I(mag^2)::stars' but this key is not in the JSON object.
warning: Specifier '%(1::coef::I(mag^2)::se).04f' refers to key '1::coef::I(mag^2)::se' but this key is not in the JSON object.

Notice that to generate this table, we only had to estimate one new regression specification, the one whose results we saved in example_model_4.json for column (2). For column (1), we reused the results that we already had in example_model_1.json.

Author

Gabor Nyeki. Contact information is on https://www.gabornyeki.com/.

tomltable's People

Contributors

gn0 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.