Giter Site home page Giter Site logo

Comments (18)

foost avatar foost commented on September 25, 2024

from the Skype call:

  • present current paper and results, discuss
  • suggestions and best practice guide
  • hands-on practical work
  • think about how or what participants can submit, and whether and how there will be a review

from reproducible-agile.github.io.

nuest avatar nuest commented on September 25, 2024
  • submit code and data sample beforehand for organizers to review/prepare
  • not just become a pre-conference conference

from reproducible-agile.github.io.

nuest avatar nuest commented on September 25, 2024

A draft for the proposal is ready for contributions at https://github.com/o2r-project/agile-2017/blob/master/public/files/agile2018_proposal.md

from reproducible-agile.github.io.

nuest avatar nuest commented on September 25, 2024

more ideas: let's have a "hands-on" morning, and a more theoretical afternoon?

But there is a worth in having a more conceptual/theoretical discussion, might attract different people. Can we move that to some evening event?

Ideas for hands-on sessions

  • How to write a reproducible paper with R markdown
  • How to write a reproducible lab notebook with Jupyter
  • scripting GIS workflows in QGIS and ArcGIS

from reproducible-agile.github.io.

nuest avatar nuest commented on September 25, 2024

TUD thinks about RDM workshop, get back to them.

from reproducible-agile.github.io.

foost avatar foost commented on September 25, 2024

150 minutes seem little time to teach enough for participants to be able to do anything meaningful.
do you mean to split up the group as per expressed interests?

from reproducible-agile.github.io.

cgranell avatar cgranell commented on September 25, 2024

the practical proposal is ok, but I'd like to have content more catchy in the sense of "actually" reproducing two published papers (one in R, the other in python). Using these running examples, participants will try to reproduce them, face actual problems in doing so, and at the same time will learn practical aspects of reproducibility in R (markdown, etc) and python (jupyter, ArcGIS scripting, etc)..

from reproducible-agile.github.io.

nuest avatar nuest commented on September 25, 2024

@cgranell So you suggest to prepare two papers and then go through the process of reproducing them? @MarkusKonk Has a lot of experience in that :-)

I like the idea because it is much better to prepare than a "bring your own data" approach.

Yes, I suggest to split up the groups. I would also strictly focus on the workflow/reproducibility aspects. So for R, we would assume people know R already, but introduce them to R Markdown (e.g. by creating an R Markdown from a PDF-paper with a script as @cgranell suggests) and even the rticles package so you have the publication ready for submission, and upload it to Zenodo Sandbox at the end.
The same for Python+?-GIS: We're not teaching GIS, but show how you can construct a reproducible workflow instead of point-and-click.

@foost Would you prefer a full-day workshop? We can also cut down on the time for "Reproducibility at AGILE", just make that 15 mins. without discussion, would bring up the time for hands-on to 3 hrs.

from reproducible-agile.github.io.

nuest avatar nuest commented on September 25, 2024

I've updated the proposal a little bit to mention "publications", because we're not introducing workflow tools etc, but do have a focus on reproducible papers / reproducible publications.

This would shift the content for the GIS-stuff a bit: R Markdown also supports chunks in other programming languages, so it should be no problem to create a reproducible paper using QGIS/any Python library with it :-).

from reproducible-agile.github.io.

MarkusKonk avatar MarkusKonk commented on September 25, 2024

Right, in the last months we reproduced several papers that were published by Copernicus. It was pretty interesting not only because of the different types of errors but also regarding the output of the code, i.e. the figures which were rarely equal to those reported in the paper and oftentimes quite far away from that. I am currently discussing that in another paper but we can discuss it in the workshop, too.

from reproducible-agile.github.io.

cgranell avatar cgranell commented on September 25, 2024

Yes, that's the idea for the hands-on session

Let's start simple. Imagine we identify a paper that 's elegible to be reproduced in R. We can split the participants into groups so that all groups try to reproduce the same paper, following the workflow you specified before. The session might combine brief explanations (how-to use R markdown, how-to...) mixed with group work. Spontaneous discussion are encouraged to discuss problems that groups face while getting data, finding scripts, downloading scripts, solving execution errors, etc... Each group also write down what they are doing (based on a form we provide). For example, if finding data was easy, difficult , etc; how long does it take (in minutes), which score (0-3) was given etc. During the last half an hour, we can reflect on the difficulties to reproduce a paper, supporting tools, and best practices. As a result, we would get an experimental assessment of our proposed criteria for reproducible research, which can be input for going on out work.

Additionally, some of us might take the role of "observer" to monitor and take notes about participants' behaviour while trying to reproduce papers ...I mean, reproducibility is not only technology; it requires mind shift, a different way of doing research, create a habit, which can be studied from a social perspective too, complemented the previous form .

Other configurations might be possible...
Case 2: choose 2 papers and half of participants reproduce one , the other half the second paper.

If the above R experiment worked fine , we might extend it to python too, perhaps as a reproducibility course as part of the AGILE PhD School

Too ambitious??

from reproducible-agile.github.io.

nuest avatar nuest commented on September 25, 2024

I would not make this into a user study (how easy they find things) but tend to a more controlled version. But that are details we can figure out later. In the workshop description we could put something along the lines of

Participants explore the practical principles of reproducible papers by reproducing a provided real-world publication. Together with the instructors they create a reproducible document from public text/code/data and publish it in a data repository (and in a code repository/GitHub?).

We (@nuest @MarkusKonk) could do that for R. I do fear that if we use only R we might loose participants, so it would be great if we could offer the same (either with a Jupyter Notebook, or with Markdown + Python) for Python users. The Python example might include ArcGIS/QGIS, but we would for now not mention that.

@cgranell Totally agree on the "mind shift" aspect. We should try to teach a different way of working rather than teaching many features of tool X.

@foost What do you think? We might not appeal "regular GIS users", because you should have some experience in either R or Python, but having that as a prerequisite could solve your concerns wrt time.

We should eat our own dogfood and make the teaching material part of a GitHub repo und CC-BY, so of course AGILE PhD school is a reasonable future step to teach this.

from reproducible-agile.github.io.

MarkusKonk avatar MarkusKonk commented on September 25, 2024

To me, it sounds interesting.

from reproducible-agile.github.io.

hoferb avatar hoferb commented on September 25, 2024

I agree completely with changing to reproducing existing work - or looking into how to make a 'research package' from existing material. Seems to me that both sides of the coin can be discussed.

(I just tried to reproduce the R script Daniel prepared but failed early on, when installing a package did not work. Very annoying. I would enjoy to learn more about reproducition using R Markdown ;)

from reproducible-agile.github.io.

nuest avatar nuest commented on September 25, 2024

Description updated, feedback welcome (especially on the newly added "Objectives" section).

from reproducible-agile.github.io.

MarkusKonk avatar MarkusKonk commented on September 25, 2024

Hi,
The proposal reads well.
Just a few comments on the description:

  • "newcomers" is probably not the right word
  • "...a short description of experience in R/Python" - do we have to say that they cannot attent if they are not familiar with R/Python? How about explicitly mentioning expected skills?
  • "...a summary of previously conducted or currently running computational work"
  • preferred hands-on session (R or Python)

from reproducible-agile.github.io.

nuest avatar nuest commented on September 25, 2024

I have updated the workshop proposal because we were a lot above the 1000 character limit for the workshop abstract, which I forgot to put in my template... anyways, we do have a longer text ready for the website now :-)

I also uploaded the PDF for submission, just in case somebody wants to do a final check: https://github.com/o2r-project/agile-2017/blob/master/public/files/agile2018_workshop_proposal_reproducible-research-publications-at-AGILE.pdf

from reproducible-agile.github.io.

nuest avatar nuest commented on September 25, 2024

Workshop was submitted and accepted, see https://agile-online.org/programme-2018/agile-workshops-2018

image

from reproducible-agile.github.io.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.