Friday, November 25, 2011

Some great resources on the web covering Machine Learning, Data Mining, R, Python and other topics

These are just a few web resources around Python, R, machine learning, data mining, and some other technical topics I've come across. I will update this list as time permits. In the mean time..enjoy!!

Some Good R videos
A collection of talks given by Hadley Wickham @hadleywickham on R
Stanford OpenClassroom - a bunch of CS classes Stanford. Full Courses Short Videos
List of freely available programming books

KDNuggets - latest KD news
Khan Academy application of machine learning to assess student mastery
CMU Machine Learning course

Quick Python Facts
Invent Your Own Computer Games with Python

Courtesy of @jeremyphoward
Getting in Shape for the sport of Data Science

Courtesy of @hackingdata
UCB Intro to Data Science
A good list of Machine learning, Statistical computing related courses
A great video with Jeff Hammerbacher detailing the future of Big Data

Courtesy of @peteskomoroch
Hidden Videos Courses in Math, Science and Engineering
Updated List of Datasets and Video Lectures



Saturday, September 10, 2011

Project Timbutku is now Texas A&M Data Wranglers !!!

It's official, Project Timbuktu is now Texas A&M Data Wranglers (TADW). TADW is now an officially approved student organization. We are still going through the recognition process and will be wrapping that up in the next week.

It's quite exciting to see we have come this far. I've been talking to a couple of people about TADW and I'm quite encouraged with the feedback I'm getting.

Now the real work begins.. find some industry speakers, sponsors, and funding.

By the way we are on the web, on twitter @txdatawranglers and linkedin

For more information about our student organization see this PDF

Sunday, July 10, 2011

What I'm doing this Summer

The long summer is upon us and I'm taking the much needed break from school. Though I'm still keeping really busy. This is a rundown of some of the things I'll be doing / working on this summer.
  • I'm interning at a startup company in the employment science space @rezscore as a Data Scientist Intern, They employ statistical and scientific techniques to grade resumes and match them to job descriptions. The free resume grading service also offers suggestions on how you could make some improvements to your resume. I'm having to implement supervised and unsupervised machine learning and data mining algorithms to classify and score the resumes.  It's been a lot of fun. I'm doing most of my work in Python and use R, SQL, Excel and SAS as needed to clean and structure data. (I will blogging at length later about techniques and steps in the data cleaning / structuring process and other tips and trick  figured out along the way)
  • I will be attending Scipy 2011 in Austin, TX . I'm really looking forward to the conference as this is my first professional Python conference, and this will also give me the chance to reconnect with friends in Austin.
  • I an working on creating a student club at A&M from Fall 2011. The objective and vision of the organization will be to bring like-minded students together in a collaborative environment to work on interesting data science / Big Data problems. I'm tagging this, Project Timbuktu for now.  Jumping through administrative hoops, finding an advisor and securing funding for the organization will not be an easy task especially in the current climate where departments are slashing budgets and expenses. More on  Project Timbuktu later.
  • I will also be attending  PyTexas 2011. This is actually at the end of the summer / early September in College Station
  • I attended a startup lessons / Lean startup conference at Austin Tech Ranch (startup incubator) early in the summer. This was basically a day long telecast conference which featured several startup big hitters. More on this later.


Tuesday, April 12, 2011

Hello World !!

For this post, I would like to introduce myself and thank you for stopping by. I'm Ike and I am currently a graduate student at Texas A&M where I am pursing a masters degree in Industrial and Systems Engineering with a focus on computational sciences and data mining.

The exciting world of data is here to stay and I do believe that data analytics is the next frontier in the information age. This point is underscored with the recent emergence of data analytics in social media, healthcare,  IT and business.

It's all about the data, and the organizations that are able to leverage this, will remain the few that will continue to evolve and revolve to stay relevant in the marketplace.

I will be blogging about my thoughts on collective Intelligence, data wrangling, data mining, predictive modeling, social media, start-ups, analytics and also pen down ideas, things I hope not to forget, projects I'm currently working on and experiences.

I'm really excited about this venture.