Projects

Hierarchical Confidence Based Clustering

This was a group project I was part of for my Machine Learning class under Dr. Isbell. Each group developed their own semester long project and for the final submission, wrote a NIPS style paper and presented their work.

One of the difficulties of Machine Learning and classification in specific is the curse of dimensionality. This is basically the idea that the more features/dimensions you have to classify on the more computation is needed. To alleviate this a lot of time can be invested in trying to fine tune the features used on a particular learning algorithm.

The idea behind our project was to exploit the internal workings of Neural Networks to automatically perform feature selection and additionally reduce overall computation time.
We had some promising results that gave us better accuracies than a well performing statistical algorithm on our data sets. Our results were not a large improvement though, so this is one area further work would focus on. Even without a huge accuracy gain though, our method provides a mechanism for parallelization of Neural Network learning.

A copy of our final paper is available in pdf.

Features:

  • Feature Selection: Dynamic feature selection for the given dataset.
  • Parallelization: Multiple classifiers opens the possibilities for improved computation time.
Anatrove.net
Anatrove is my pet project. The idea behind it is the ability to make a smarter decision about how you purchase your music. The only differences between Amazon.com and iTunes are delivery time and the extras you get with a physical cd: Liner notes, lyric sheets, art, stories etc. Some albums have much in this area and others do poorly. Why spend time and money when you do not get anything extra?

This site is a way for me to keep up on various skills that I do not get in my job. This site is wholly implemented by me and no WYSIWYG tools were used. The layout you see was designed by my friend Rachel Knickmeyer, who is an HCI designer; the wireframe design she created can be found in her portfolio.

The backend of the site is done in perl with mySQL used for the user and album database. The front end is HTML which is dynamically generated by the server. Most recently, I have added AJAX to the main page to allow for dynamic navigation of random albums.

Features:

  • AJAX: Dynamic random album navigation with "grow" only containing divs.
  • Email Form Protection: SQL injection protection, negative captcha to detect spam bots.
  • Sign up/Login: Random keys stored in cookies to validate the client is the same as the one that signed up or logged in.
Chronological Photo Set Map
This is my first use of Google Maps and one of my more personally usable apps. I store all my photos on flickr; however, at the time, I had no way to access them in a visually useful way. I decided this would be a good way to learn about Google maps and how to work with them.

The map pinpoints are fairly basic with the intricacy coming into play with the addition of chronology to the map. Each icon has a different color based on two ranges of time. Old to less recent photos are shades of grey from light to dark, and more recent photos are transitions from dark grey to green. Through these two mechanisms I was able to create an appealing view of my photo sets with an additional information visualization.

Features:

  • Geography: Photo sets, with preview and link to flickr, found in the location they were taken, allowing for easier and more interesting navigation.
  • Chronology: Dynamic color coding adds extra information to the visualization.
Dallas Area Meetup Member Map
This project was created to help the members of my local Flickr photography group. The map plots each user at coordinates they specify ( allows them to anonymize themselves as desired ). Each marker then contains the users icon and flickr name, both linked back to their flick accounts.

One additional feature I added was a weighted geographic center, so that members could use this information in addition to the general geography to help plan good location for meetups.

Features:

  • Geography: Each user plotted at locations for their choosing. Markers show their user icon and name linked to their flickr account.
  • Group Center: Dynamically calculated center of the entire group.
  • Group List: All members on the map are listed below it, and if clicked show on the map.