Using the Amazon Cloud
Jump to navigation
Jump to search
Intro
Please fill the blanks on this page as you start using the Amazon Web Services.
Cost
Pietro's guidance from lab meeting 10/21/2008: compare the cost to your $1000 plane ticket to a conference. $200-300 is reasonable, $10K is not.
Tools
- Elasticfox Firefox Extension for Amazon EC2
- Hadoop is an open source version of Google's MapReduce Framework to distribute data and processing over a large number of machines.
- MapReduce: Simplified Data Processing on Large Clusters. Jeffrey Dean and Sanjay Ghemawat. 2004. PDF
- Map-Reduce for Machine Learning on Multicore. Chu et al. NIPS 2007. PDF
- Google News Personalization: Scalable Online Collaborative Filtering. Das et al. WWW2007. PDF
- Hadoop on Amazon EC2
- Amazon Elastic MapReduce
Datasets
You can ask Amazon to put public datasets for free access at Public Datasets on AWS.
Tips and Tricks
Please share your tips, tricks and hacks.