Digital Fitness History

Posts

Pre-Processing Text for MALLET

**This post is the third in a set of 5 written by the Digital History Fellows at the Roy Rosenzweig Center for History and New Media. The original post can be found on the Digital History Fellowship Blog. In our previous post, we described the process of writing a python script that pulled from the THATCamp MySQL Database. In this post, we will continue with this project and work to clean up the data we’ve collected and prepare it for some analysis.

Design Using a Grid

The reading for this week in _Thinking with Type _about grids was extremely interesting to me. The section about tables and grids in html particularly peaked my interest. I think the hardest part of html and css is learning layout but I think a grid makes the whole idea of layout easier to understand. I’ve always used grids when I code, particularly I like the 960 grid system which makes layout very easy and allows you to work of a variety of grid layouts.

Extracting Data from the THATCamp Database Using Python and MySQL

*This post is the second in a set of 4 written by the Digital History Fellows at the Roy Rosenzweig Center for History and New Media. The original post can be found on the Digital History Fellowship Blog. This week we’ve continued to work on building a python script that will extract all of the blog posts from the various THATCamp websites. As Jannelle described last week, our goal was to write a script that downloads the blog posts in plain text form and strips all of the html tags, stopwords, and punctuation so that we can feed it into MALLET for topic modeling and text analysis.

Spring Semester in Research and a THATCamp Challenge

**This post is the first in a set of 4 written by the Digital History Fellows at the Roy Rosenzweig Center for History and New Media. The original post by Jannelle Legg can be found on the Digital History Fellowship Blog. The spring semester is here and the first year DH fellows have begun our rotation into the Research division of CHNM. To get the ball rolling, we spent a week working through the helpful tutorials at theProgramming Historian.

Typographical Design and Communication

This week’s materials on Typography were really interesting. I especially enjoyed Morris’s article «Hear, All Ye People; Hearken, O Earth.» I’ve always been interested in Typography but I had never realized the extent to which a typeface can affect the credibility of a site. Typography is one of the most important, if not the most important, element in the design of a website yet it’s something that we rarely consciously think about.