Jenny Palomino

Jenny Palomino has contributed to the materials listed below. Jenny is an earth data science course developer and instructor with the Earth Analytics Education Initiative at Earth Lab.

Course Lessons

Course lessons are developed as a part of a course curriculum. They teach specific learning objectives associated with data and scientific programming. Jenny Palomino has contributed to the following lessons:

Use Twitter Data to Explore the 2013 Colorado Flood Using Open Source Python

In this lesson you will learn how to parse a JSON file containing twitter data to better understand the 2013 Colorado Floods using Open Source Python tools.

Open and Use MODIS Data in HDF4 format in Open Source Python

MODIS is remote sensing data that is stored in the HDF4 file format. Learn how to open and use MODIS data in HDF4 form in Open Source Python.

Introduction to the HDF4 Data Format - Explore H4 Files Using HDFView

MODIS is remote sensing data that is stored in the HDF4 file format. Learn how to view and explore HDF4 files (and their metadata) using the free HDF viewer provided by the HDF group.

Find and Download MODIS Data From the USGS Earth Explorer Website

Learn how to find and download MODIS data from the USGS Earth Explorer website.

Work with MODIS Remote Sensing Data using Open Source Python

MODIS is a satellite remote sensing instrument that collects data daily across the globe at 250-500 m resolution. Learn how to import, clean up and plot MODIS data in Python.

Customize Dates on Time Series Plots in Python Using Matplotlib

When you plot time series data using the matplotlib package in Python, you often want to customize the date format that is presented on the plot. Learn how to customize the date format on time series plots created using matplotlib.

Resample or Summarize Time Series Data in Python With Pandas - Hourly to Daily Summary

Sometimes you need to take time series data collected at a higher resolution (for instance many times a day) and summarize it to a daily, weekly or even monthly value. This process is called resampling in Python and can be done using pandas dataframes. Learn how to resample time series data in Python with Pandas.

Subset Time Series By Dates Python Using Pandas

Sometimes you have data over a longer time span than you need for your analysis or plot. Learn how to subset your data using a begin and end date in Python.

Work With Datetime Format in Python - Time Series Data

Python provides a datetime object for storing and working with dates. Learn how you can convert columns in a pandas dataframe containing dates and times as strings into datetime objects for more efficient analysis and plotting.

Work With Datetime Format in Python - Time Series Data

Python provides a datetime object for storing and working with dates. Learn how you can convert columns in a pandas dataframe containing dates and times as strings into datetime objects for more efficient analysis and plotting.

Write Functions with Multiple Parameters in Python

A function is a reusable block of code that performs a specific task. Learn how to write functions that can take multiple as well as optional parameters in Python to eliminate repetition and improve efficiency in your code.

Write Functions in Python

A function is a reusable block of code that performs a specific task. Learn how to write functions in Python to eliminate repetition and improve efficiency in your code.

Introduction to Writing Functions in Python

A function is a reusable block of code that performs a specific task. Learn how functions can be used to write efficient and DRY (Do Not Repeat Yourself), code in Python.

Automate Data Tasks With Loops in Python

Loops can be used to automate data tasks in Python by iteratively executing the same code on multiple data structures. Learn how to automate data tasks in Python using data structures such as lists, numpy arrays, and pandas dataframes.

Introduction to Using Loops to Automate Workflows in Open Source Python

Loops can help reduce repetition in code by iteratively executing the same code on a range or list of values. Learn about the basic types of loops in Python and how they can be used to write Do Not Repeat Yourself, or DRY, code in Python.

Conditional Statements with Alternative or Combined Conditions

Conditional statements in Python can be written to check for alternative conditions or combinations of multiple conditions. Learn how to write conditional statements in Python that choose betweeen alternative conditions or check for combinations of conditions before executing code.

Intro to Conditional Statements in Python

Conditional statements help you to control the flow of code by executing code only when certain conditions are met. Learn about the structure of conditional statements in Python and how they can be used to write Do Not Repeat Yourself, or DRY, code in Python.

Practice Forking a GitHub Repository and Submitting Pull Requests

A pull request allows anyone to suggest changes to a repository on GitHub that can be easily reviewed by others. Learn how to submit pull requests on GitHub.com to suggest changes to a GitHub repository.

An Example of a Github Collaborative Workflow for Team Science

GitHub.com can be used to store and access files in the cloud using GitHub repositories. Learn how to submit pull requests on GitHub.com to suggest changes to a GitHub repository.

Track, Manage and Discuss Project Changes and Updates Using GitHub Issues

An issue is a GitHub project management tool that allows anyone to identify and discuss potential changes to a repo. Learn how to create and manage GitHub issues to support collaborative open reproducible science projects.

Sync a GitHub Repo: How To Ensure Your GitHub Fork Is Up To Date

When you are working on a forked GitHub repository you will need to update your files frequently. Learn how to update your GitHub fork using a reverse pull request.

How To Create A Pull Request on Github: Propose Changes to GitHub Repositories

A pull request allows anyone to suggest changes to a repository on GitHub that can be easily reviewed by others. Learn how to submit pull requests on GitHub.com to suggest changes to a GitHub repository.

Learn How To Use GitHub to Collaborate on Open Science Projects

GitHub is a website that supports git-based version control and collaborative project management. Learn how to use git and GitHub to collaborate on projects in support of open reproducible science.

Select Data From Pandas Dataframes

Pandas dataframes are a commonly used scientific data structure in Python that store tabular data using rows and columns with headers. Learn how to use indexing and filtering to select data from pandas dataframes.

Run Calculations and Summary Statistics on Pandas Dataframes

Pandas dataframes are a commonly used scientific data structure in Python that store tabular data using rows and columns with headers. Learn how to run calculations and summary statistics (such as mean or maximum) on columns in pandas dataframes.

Import CSV Files Into Pandas Dataframes

Pandas dataframes are a commonly used scientific data structure in Python that store tabular data using rows and columns with headers. Learn how to import text data from .csv files into numpy arrays.

Intro to Pandas Dataframes

Pandas dataframes are a commonly used scientific data structure in Python that store tabular data using rows and columns with headers. Learn about the key characteristics of pandas dataframes that make them a useful data structure for storing and working with labeled scientific datasets.

Slice (or Select) Data From Numpy Arrays

Numpy arrays are an efficient data structure for working with scientific data in Python. Learn how to use indexing to slice (or select) data from one-dimensional and two-dimensional numpy arrays.

Run Calculations and Summary Statistics on Numpy Arrays

Numpy arrays are an efficient data structure for working with scientific data in Python. Learn how to run calculations and summary statistics (such as mean or maximum) on one-dimensional and two-dimensional numpy arrays.

Import Text Files Into Numpy Arrays

Numpy arrays are an efficient data structure for working with scientific data in Python. Learn how to import text data from .txt and .csv files into numpy arrays.

Intro to Numpy Arrays

Numpy arrays are a commonly used scientific data structure in Python that store data as a grid, or a matrix. Learn about the key characteristics of numpy arrays that make them an efficient data structure for storing and working with large scientific datasets.

Use the OS and Glob Python Packages to Manipulate File Paths

The os and glob packages are very useful tools in Python for accessing files and directories and for creating lists of paths to files and directories, respectively. Learn how to manipulate and parse file and directory paths using os and glob.

Write Code That Will Work On Any Computer: Introduction to Using the OS Python Package to Set Up Working Directories and Construct File Paths

Manually constructed files paths will often not run on computers with different operating systems. Learn how to construct file paths in Python that will work on Mac, Linux and Windows, in support of open reproducible science.

Working Directories, Absolute and Relative Paths and Other Science Project Management Terms Defined

A directory refers to a folder on a computer that has relationships to other folders. Learn about directories, files, and paths, as they relate to creating reproducible science projects.

Install Packages in Python

Packages in Python provide pre-built functionality that adds to the functionality available in base Python. Learn how to install packages in Python using conda environments.

Use Conda Environments to Manage Python Dependencies: Everything That You Need to Know

A conda environment is a self contained Python environment that allows you to run different versions of Python (with different installed packages) on your computer. Learn how to conda environments can you help manage Python packages and dependencies.

Python Packages for Earth Data Science

The Python programming language provides many packages and libraries for working with scientific data. Learn about key Python packages for earth data science.

Scientist's Guide to Plotting Data in Python Textbook | Earth Lab CU Boulder

Customize Your Plots Using Matplotlib

Matplotlib is the most commonly used plotting library in Python. Learn how to customize the colors, symbols, and labels on your plots using matplotlib.

Introduction to Plotting in Python Using Matplotlib

Matplotlib is the most commonly used plotting library in Python. Learn how to create plots using the matplotlib object oriented approach.

DRY Code and Modularity

DRY (Do Not Repeat Yourself) code supports reproducibility by removing repetition and making code easier to read. Learn about key strategies to write DRY code in Python.

Make Your Code Easier to Read By Using Expressive Variable Names in Python

Expressive variable names refer to function and variable names that describe what the variable contains or what the function does. Using expressive names makes your code easier to understand. Learn how to create expressive names for objects in your Python code.

Clean Code Syntax for Python: Introduction to PEP 8 Style Guide

Using a standard format and syntax when programming makes your code easier to read. Learn more about PEP 8, a set of guidelines for writing clean code in Python.

Introduction to Writing Clean Code and Literate Expressive Programming

Clean code refers to writing code that runs efficiently, is not redundant and is easy for anyone to understand. Learn about the characteristics and benefits of writing clean, expressive code in Python.

Basic Operators in Python

Operators are symbols in Python that carry out a specific computation, or operation, such as arithmetic calculations. Learn how to use basic operators in Python.

Lists in Python

A Python list is a data structure that stores a collection of values in a specified order (or sequence) and is mutable (or changeable). Learn how to create and work with lists in Python.

Variables in Python

Variables store data (i.e. information) that you want to re-use in your code (e.g. single numeric value, path to a directory or file). Learn how to to create and work with variables in Python.

Introduction to the Python Scientific Programming Language for Earth Data Science

Python is a free, open source programming language that can be used to work with scientific data. Learn about using Python to develop scientific workflows.

Format Text In Jupyter Notebook With Markdown

Markdown allows you to format text using simple, plain-text syntax and can be used to document code in a variety of tools, including Jupyter Notebook. Learn how to format text in Jupyter Notebook using Markdown.

Text File Formats for Earth Data Science

There are many text file formats that are useful for earth data science workflows including Markdown, text (.txt, .csv) files, and YAML (Yet Another Markup Language). Learn about these common text file formats for earth data science workflows.

Useful Jupyter Notebook Shortcuts

The Jupyter ecosystem contains many useful tools for working with Python including Jupyter Notebook, an interactive coding environment. Learn useful shortcuts in Jupyter Notebook that can help you complete your tasks quickly and efficiently.

Manage Jupyter Notebook Files

The Jupyter ecosystem contains many useful tools for working with Python including Jupyter Notebook, an interactive coding environment, and the Jupyter Notebook dashboard, which allows you to manage files and directories in your Jupyter environment. Learn how to manage Jupyter Notebook files including saving, renaming, deleting, moving, and downloading notebooks.

Manage Directories in Jupyter Notebook Dashboard

The Jupyter ecosystem contains many useful tools for working with Python including the Jupyter Notebook dashboard, which allows you to manage files and directories in your Jupyter environment. Learn how to create, rename, move, and delete directories using the Jupyter Notebook dashboard.

Code and Markdown Cells in Jupyter Notebook

The Jupyter ecosystem contains many useful tools for working with Python including Jupyter Notebook, an interactive coding environment. Learn how to work with cells, including Python code and Markdown text cells, in Jupyter Notebook.

Get Started With Jupyter Notebook For Python

The Jupyter ecosystem contains many useful tools for working with Python including Jupyter Notebook, an interactive coding environment. Learn how to launch and close Jupyter Notebook sessions and how to navigate the Jupyter Dashboard to create and open Jupyter Notebook files (.ipynb).

Introduction to Jupyter For Python

The Jupyter ecosystem contains many useful tools for working with Python including Jupyter Notebook, an interactive coding environment. Learn how the components and functionality of Jupyter Notebook can help you implement open reproducible science workflows.

Bash Commands to Manage Directories and Files

Bash or Shell is a command line tool that is used in open science to efficiently manipulate files and directories. Learn how to run useful Bash commands to access and manage directories and files on your computer.

Introduction to Bash (Shell) and Manipulating Files and Directores at the Command Line

Bash or Shell is a command line tool that is used in open science to efficiently manipulate files and directories. Learn how to use Bash to manipulate files in support of reproducible science.

How To Organize Your Project: Best Practices for Open Reproducible Science

Open reproducible science refers to developing workflows that others can easily understand and use. Learn about best practices for organizing open reproducible science projects including the use of machine readable names.

Tools For Open Reproducible Science

Key tools for open reproducible science include Shell (Bash), git and GitHub, Jupyter, and Python. Learn how these tools help you implement open reproducible science workflows.

What Is Open Reproducible Science

Open reproducible science refers to developing workflows that others can easily understand and use. It enables you to build on others' work rather than starting from scratch. Learn about the importance and benefits of open reproducible science.

How to Join Attributes From One Shapefile to Another in Open Source Python Using Geopandas: GIS in Python

A spatial join is when you assign attributes from one shapefile to another based upon its spatial location. Learn how to perform spatial joins in Python.

How Do You Design and Automate a Data Workflow

Designing and developing data workflows can help you complete your work more efficiently by allowing you to repeat and automate data tasks. Learn how to design and develop efficient workflows to automate data analyses in Python.

Learn to Write Pseudocode for Python Programming

Pseudcode can help you design data workflows through listing out the individual steps of workflow in plain language, so the focus is on the overall data process, rather than on the specific code needed. Learn best practices for writing pseudocode for data workflows.

Data Workflow Best Practices - Things to Consider When Processing Data

Identifying aspects of a workflow that can be modularized and tested can help you design efficient and effective data workflows. Learn best practices for designing efficient data workflows.

Introduction to Open Source Software - What Is It and How Can You Help?

Open source means that you can view and contribute to software code like packages you use in Python. Learn about the ways that you can contribute without being an expert progammer.

Practice Using Git and GitHub to Manage Files

Practice your skills setting up git locally, committing changes to files and pushing and pulling files to GitHub.com

Undo Local Changes With Git

A version control system allows you to track and manage changes to your files. Learn how to undo changes in git after they have been added or committed to version control.

Get Started with Git Commands for Version Control

A version control system allows you to track and manage changes to your files. Learn how to use some basic Git commands including add, commit and push.

How To Setup Git Locally On Your Computer

Learn how to setup git locally on your computer.

Copy (Fork) and Download (Clone) GitHub Repositories

GitHub.com can be used to store and access files in the cloud to share with others or simply as a backup of your local files. Learn how to create a copy of files on GitHub (fork) and to download files from GitHub to your computer (clone).

What Is Version Control

A version control system allows you to track and manage changes to your files. Learn benefits of version control for scientific workflows and how git and GitHub.com support version control.

Activity on Dry Code

This activity provides an opportunity to practice writing DRY code using loops, conditional statements, and functions.

Guided Activity on Git/Github.com For Collaboration

This lesson teaches you how to collaborate with others in a project, including tasks such as notifying others that an assigned task has been completed.

Guided Activity on Undo Changes in Git

This lesson teaches you how to undo changes in Git after they have been added or committed.

Crop a Spatial Raster Dataset Using a Shapefile in Python

This lesson covers how to crop a raster dataset and export it as a new raster in Python

How to Dissolve Polygons Using Geopandas: GIS in Python

In this lesson you review how to dissolve polygons in python. A spatial join is when you assign attributes from one shapefile to another based upon its spatial location.

How to Reproject Vector Data in Python Using Geopandas - GIS in Python

Sometimes two shapefiles do not line up properly even if they cover the same area because they are in different coordinate reference systems. Learn how to reproject vector data in Python using geopandas to ensure your data line up.

Get Started With GIS in Open Source Python - Geopandas, Rasterio & Matplotlib

There are a suite of powerful open source python libraries that can be used to work with spatial data. Learn how to use geopandas, rasterio and matplotlib to plot and manipulate spatial data in Python.

Set Up Your Conda Earth Analytics Python Environment

Conda environments allow you to easily manage the Python package installations on your computer. Learn how to install a conda environment using a yml file.

Setup Git, Bash, and Conda on Your Computer

Learn how to install Git, GitBash (a version of command line Bash) and the Miniconda Python distribution on your computer.

Setup Your Earth Analytics Python, Git, Bash Environment On Your Computer

There are several core tools that are required to work with data. These include Shell/Bash, Git/Github and Python. Learn how to set all of these tools up on your computer so you can work with different types of data using open science workflows.

GEOG 4463 & 5463 - Earth Analytics Bootcamp: Pre-Course Checklist

GEOG 4463 & 5463 - Earth Analytics Bootcamp: Pre-Course Learning Resources

GEOG 4463 & 5463 - Earth Analytics Bootcamp: August 2018 Syllabus

Earth Analytics Bootcamp Course | Earth Lab CU Boulder

Introduction to Earth Data Science | Earth Lab CU Boulder

Overlay Raster and Vector Spatial Data in A Matplotlib Plot Using Extents in Python

When plotting raster and vector data together, the extent of the plot needs to be defined for the data to overlay with each other correctly. Learn how to define plotting extents for Python Matplotlib Plots.

Interactive Maps in Python

Folium is a Python package that can be used to create interactive maps in Jupyter Notebook. Learn how to create interactive maps with raster overlays in Python using Folium.

Analyze The Sentiment of Tweets From Twitter Data and Tweepy in Python

One way to analyze Twitter data is to analyze attitudes (or sentiment) in the tweet text. Learn how to analyze sentiments in Twitter data using open source Python.

Analyze Co-occurrence and Networks of Words Using Twitter Data and Tweepy in Python

One common way to analyze Twitter data is to identify the co-occurrence and networks of words in Tweets. Learn how to analyze word co-occurrence (i.e. bigrams) and networks of words using Python.

Analyze Word Frequency Counts Using Twitter Data and Tweepy in Python

One common way to analyze Twitter data is to calculate word frequencies to understand how often words are used in tweets on a particular topic. To complete any analysis, you need to first prepare the data. Learn how to clean Twitter data and calculate word frequencies using Python.

Programmatically Accessing Geospatial Data Using APIs

This lesson walks through the process of retrieving and manipulating surface water data housed in the Colorado Information Warehouse. These data are stored in JSON format with spatial x, y information that support mapping.

Introduction to Working With JSON Data in Open Source Python

This lesson introduces how to work with the JSON data structure using Python using the JSON and Pandas libraries to create and convert JSON objects.

Introduction to APIs

API's allow you to automate access and downloading data in your code to support open reproducible science. Learn how how to use API's to download data from the internet using open source python.

Data tutorials

Nothing to list here yet!