Giter Site home page Giter Site logo

citavir's Introduction

CitaviR

Project Status: Active โ€“ The project has reached a stable, usable state and is being actively developed. Lifecycle: maturing

This is an unofficial helper package for dealing with Citavi.
I am not affiliated with Citavi, just a fan.

Citavi (Official Website, Official GitHub) is a software program for reference management and knowledge organization. When working with local Citavi projects (as opposed to cloud or server projects) you can directly work on the (database stored in) the .ctv6 file via SQL. CitaviR provides functionality for

  1. reading the data from the .ctv6 file
  2. dealing with the data while it is outside Citavi to get the most out of it
  3. writing/updating the data into the .ctv6 file

Installation

You can install the development version of CitaviR from GitHub:

devtools::install_github('SchmidtPaul/CitaviR')

Example

You can find an example workflow on the Get Started page.

citavir's People

Contributors

hadley avatar schmidtpaul avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Forkers

jimsforks dl0s

citavir's Issues

detect_country()

Making use of countrycode::codelist, detect the country from keyword, title and/or abstract like so:

read_Citavi_xlsx(example_xlsx('3dupsin5refs.xlsx')) %>%
   detect_country()
  • write function
  • write tests
  • provide macro to import results into Citavi

find_obvious_dups()

Find obvious duplicates based on title and year and add gained information as columns like so:

read_Citavi_xlsx(example_xlsx('3dupsin5refs.xlsx')) %>%
   find_obvious_dups()
  • write function
  • write tests
  • write in example of README.Rmd
  • provide macro to import results into Citavi

read_Citavi_xlsx()

Import like so:

import_path <- example_xlsx('3dupsin5refs.xlsx')
CitDat <- read_Citavi_xlsx(import_path ) 
  • write function
  • write tests
  • write in example of README.Rmd
  • use col_types = for all columns based on the Column Info Master of #5
  • do we need str_replace_all(Categories, "\\s+", " ") for all text columns?
  • prevent warnings Coercing text to numeric for setSuggestedColTypes = FALSE

write_Citavi_xlsx()

Export processed CitDat to Excel with modified path like so:

import_path <- example_xlsx('3dupsin5refs.xlsx')

CitDat <- read_Citavi_xlsx(import_path ) %>%
   find_obvious_dups() %>%
   handle_obvious_dups()

write_Citavi_xlsx(CitDat, import_path)
  • write function
  • write tests
  • write in example of README.Rmd

handle_obvious_dups()

Take obvious duplicates, mark one as the "non-duplicate" and all others as the "duplicates" while making sure to extract all information across all duplicates and save it in the fields of the "non-duplicate"

read_Citavi_xlsx(example_xlsx('3dupsin5refs.xlsx')) %>%
   find_obious_dups() %>%
   handle_obvious_dups()
  • write function
  • write tests
  • write in example of README.Rmd
  • provide macro to import results into Citavi
  • document smart handling of Online-Address
  • provide a "default" or "all" list of fields for fieldsToHandle=

example datasets

Add more example datasets with these features:

  • is large enough that find_potential_dups() has to go trough if (NumberOfComp > maxNumberOfComp).
  • has different languages for detect_language()
  • has different country names in abstract, title, keyword for detect_country()
  • has nice entries for showing the smart handling of Online Address via handle_obvious_dups()

Column Info Master

For all columns Citavi can export, have a master file in CitaviR that lists

  • English names
  • names in all other languages
  • type (text or integer)
  • variable name for macro syntax

detect_language()

Make use of textcat to identify the language of the abstract like so:

read_Citavi_xlsx(example_xlsx('3dupsin5refs.xlsx')) %>%
   detect_language()
  • write function
  • write tests
  • provide macro to import results into Citavi

find_potential_dups()

Find potential duplicates based on title and year and add gained information as columns like so:

read_Citavi_xlsx(example_xlsx('3dupsin5refs.xlsx')) %>%
   find_obvious_dups() %>%
   find_potential_dups()
  • write function
  • write tests
  • write a vigniette
  • provide macro to import results into Citavi
  • add quiet = TRUE

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.