Research Data Management (RDM) is essential for responsible research and planning should begin early. Your well-organized and documented data will meet funding agency requirements, be preserved, discoverable and reproducible.
This seminar will review what RDM is, how it applies to your research, and who to contact for assistance.
Turn messy data into tidy data! Much of your time will be spent in this ‘data wrangling’ stage. It’s not the most fun, but it’s necessary and food data organization is the foundation of data analysis. Learn the rules for a 'tidy dataset' in order to clean and prepare your data with examples and tools.
This virtual session will teach you how think about data organization and some practices for more effective data wrangling. As a note, we will not discuss analysis or visualisation
Final_v2_rev3_final_FINAL.docx got you down? Version tracking, or version control, is the management of changes to any file or document. Version control is like a savings program for your project. Implementing a file versioning strategy at the beginning of your research project will help to avoid confusion amongst collaborators and avoid lost time and effort trying to recover the "right" version of a file.
This virtual session will review a few ways for tracking different versions of a file, focusing on using git for version control.
What's in a name? File naming, when done in a well-organized fashion, can contribute to project documentation, workflow organization, and sharing. Moreover, certain choices in file naming are essential to accessing and sharing files across a computing systems. Establishing naming conventions for your files and using them consistently will ensure maximum access to your data and records.
This virtual session will review how file naming conventions will save you time by keeping your work organized and understandable.
To ensure that you understand your own data and to enable others to find, use and properly cite your data, it helps to add ‘documentation’ or ‘metadata’ (data about data) to the documents and datasets you create.
This virtual session will explore the critical role documentation plays in data management and how you can ensure good documentation throughout your research.
Lamont Library B-30 Collaborative Learning Space 11 Quincy St Cambridge
Workshop sponsored by the Harvard University Digital Scholarship Support Group
Are you interested in using data visualizations to explore your data or as part of your research output, but unsure of where to start? Are you already using data viz, but want to learn to create more effective presentations with different applications or programming languages?
Consider attending Visual Eloquence, a participatory workshop on visualizing data and understanding the powerful role it plays in analysis and presentation for digital scholarship. The...
Winter 2020 Research Data Management Seminar Series
Neglecting to develop and implement a detailed naming convention for your data files. Down the slope. Failing to sync and back up your data in three separate locations. Data slip up. Saving your data to a proprietary file format that is on the verge of insolvency. Data danger zone. Facing the shame of having your publication retracted due to data irregularities. The steepest of slopes.
Data management is not easy, as is evidenced by these true...
Harvard University CGIS South Building 1730 Cambridge Street Cambridge, MA 02138
A Data Science Bootcamp for better research
DataFest is a two-day workshop held during J-term and developed by data practitioners and researchers from across the university. Hosted by the Institute for Quantitative Social Science (IQSS), the purpose of the workshop is for participants to develop skills in working with data. It is structured around the data lifecycle, includes lectures and hands-on sessions focused on data concepts, project planning and data science workflows, processing and analysis, and finally...