This should be the final new follow-up thread from the community meeting yesterday.

The premise of this working group is that Pangeo’s scope need not stop at “merely” providing software tools that help do science more openly, but extend beyond that into trying to disrupt the ways in scientific publishing itself is not open.

Some ideas we could discuss:

What have we done well

  • Tools to make (Zarr) data actually available, not just “upon reasonable request”

What could be better

  • Cost models for archiving this data
  • Web-based visualization of uploaded datasets
  • Automated software / dataset citation network
  • More nuanced models of credit
    • Best practices around how to fairly give credit to contributors in open-source software and other aspects of open collaboration
  • Moving away from Jupyter Notebooks as repos as a publication format

I have insider knowledge that @jbusecke and @paigem are writing out some thoughts on this topic already…

Really interested in this topic over here :wave: and the work we’ve been doing with MyST Markdown is be really applicable – instead of notebooks-in-repos as a publication format using MyST means researchers can get a really nice web experience up easily, and with visualisations available in place with binder hookup. Rowan and I were at AGU and able to catch up with @paigem and Max but we’d love to be involved in getting something set up to support this.

Just to add a couple of examples for folks who aren’t familiar