Jason Favrod

Pandas Basics I: Series and DataFrames

What is Pandas?

Pandas is a free software (software libre) data analysis library for the Python programming language. The library provides analysts and programmers data structures optimized for working with large data sets, and methods for examining and manipulating that data. It uses another free software library, NumPy, for underlying data structures, and Pyplot to generate plots – graphs, histograms, etc.
Continue reading →

Splitting a PDF into Single Pages

I recently had a request to take a PDF file which contained multiple documents – one page each – and separate the documents out into individual files. The request was in a Microsoft Windows environment. I had used Ghostscript for such tasks in the past on Linux, and has pleased to find the software was also available for DOS based systems.

Continue reading →

Printing ASP.NET Debugging Output

For ASP.NET development, I use Microsoft’s Visual Studio (IDE). In any development environment, I regularly use print statements to discover and verify my programming’s actual functionality. In Java, simply printing to the console (with System.out.println) achieves this aim, but with ASP.NET and Visual Studio I tried the same approach (using Console.WriteLine) to no avail.

Continue reading →

PHP date Function ISO 8601 Format String

ISO 8601 with difference from Greenwich time (UTC). Continue reading →

Git Frequently Used Commands

Git is a Version Control System software used to keep track of changes in computer code bases, including configuration files. I have found the following commands useful.
Continue reading →

SQL to Pandas Translation

I’m experienced in working with SQL for data wrangling and analysis, but have recently started using the Python Pandas library for similar tasks. The thing I really like about Pandas is the ability to (combined with matplotlib) to plot/visualize the data once it’s been successfully curated. Coming from the SQL background, I’ve been approaching problems thinking in terms of SQL. So I’m documenting here some translations between SQL and Pandas’ queries. I’ll try to keep updating this as I continue to use Pandas.

Continue reading →