Your Ad Here

Posts Tagged ‘distributed’

Google App Engine is in Beta – Cloud Competition Is Arriving for Amazon

Tuesday, April 8th, 2008

Google App Engine is in beta and was announced today. They were supposed to release this week and they did.

What is the Google App Engine?

It is actually a bit different than I expected. I expected these services to directly compete with S3, EC2, SimpleDB and they do but as of right now you can only run Google App Engine apps on their servers. Amazon Web Services you can run on any server which makes for a much more flexible usage into existing apps. Not everything yet can be hosted in the cloud and having the access to store locally or in the cloud is key, right now Amazon has that right model for it. Amazon isn’t trying to be your authentication service or push you into other services, it is simply an extension of your domain. However Google’s offerings are very compelling here and threaten lots more markets than just amazon and salesforce but also hosting companies.

Google is looking like they are trying to take over the domain which might work but I am more for the parallel development. For instance, if you wanted to run storage on Google but computing on EC2 could you do that with Google’s setup, no. With Amazon you can swap out S3, EC2, SimpleDB for other things if you want. I like the componentized model much much better. Hopefully Google moves in the component model as well as their hosted solutions, this is very nice for upstart and getting things moving quickly though. Google does want to become the base.

On the other side, SalesForce is probably freaking out because if you use Google Apps for your business, this just adds in the element of business apps that can be run for entire small business infrastructure with Google Apps and any custom apps needed with Google App Engine. But they probably will adapt to use this within their systems, if it was more componentized and available outside Google App Engine.

I will post more on this when it has been through the wringer. As of right now Google’s service is extremely beta, limited on their servers and only available for small traffic. When they release pricing and more information we will see where the market falls for cloud based storage and computing.

A few things I really like about the new Google offerings is simplicity. They are using Python which arguably has some of the best toolkits out there for cloud development right now (boto being one of them for Amazon). They are pretty much using Pure Python and you can push up whatever libraries you want. They have Django and I will see if other templating engines like pyTenjin work up there. But also their configuration is in a simple format. Simple user formats I am fond of are Markdown and YAML. This is how you configure an app for Google App Engine, in YAML (Yet Another Markup Language):

application: myapp
version: 1
runtime: python
api_version: 1
 
handlers:
- url: /
script: home.py
 
- url: /index.html
script: home.py
 
- url: /stylesheets
static_dir: stylesheets
 
- url: /(.*\.(gif|png|jpg))
static_files: static/\1
upload: static/(.*\.(gif|png|jpg))
 
- url: /admin/.*
script: admin.py
login: admin
 
- url: /.*
script: not_found.py

However I see Amazon still being the champ here when comparing the current publicly known offerings. Hopefully Google is just testing their engines and infrastructure in a limited capacity and will open it up to componentization to run from anywhere, much like OpenSocial or other services like RESTful, XML-RPC, JSON-RPC etc. As of right now Google App Engine is too closely coupled for integration into many systems residing on servers not on Google’s environment.

I think Google’s App Engine service more closely matches the SalesForce.com type hosted SaaS rather than true cloud technologies just yet. It is a bit of a mix of both.

It sure is an interesting time seeing and participating in another game changing announcement and new pardigm and bend in the software market from local to cloud computing and storage.

Google has some nice setup though with these great Python libraries running for your disposal:

In addition to the Python standard library and the App Engine libraries, the runtime environment includes the following third-party libraries:

They are using django templates, these are pretty nice I have taken a liking to Mako and pyTenjin but django framework is quite nice.

Here is a video walk through of the application run through. Basic Python templating app essentially in django.

Google App Engine Walkthrough

YouTube Preview Image

Guido Speaks

YouTube Preview Image

The Software and Web Cloud

Thursday, April 3rd, 2008

What is the web and software cloud?  

Software and data storage has always evolved, and quite rapidly in technology.  From the old punch cards, to disks, to hard drives and now a new type of storage is emerging, the cloud.

Distributed computing based on the internet, the ultimate self-healing network, is now reaching a level where services that provide distributed data storage are thriving.  I will give you three areas that these types of services are happening and why you see this area of technology is set for leaps and bounds.

  • Media and Digital Asset Storage (who doesn’t use S3 in at least some projects?)
  • Software Configuration Management (SCM) Distributed Source Control (Git, Mercurial, etc)
  • Personal Data

Media and Digital Asset Storage

Streaming media, online TV, youtube, website media servers are all part of the digital asset storage movement into the software and web cloud.  Amazon S3 is a very commonly used service for this where they charge cents on the GB per data used.  This allows only used bandwidth to be charges and a bit of parallelism in your hosting setup.  If one part of your hosting (yoru files) are hosted on Amazon S3 then if your site gets slammed with traffic the largest burden is on Amazon’s Servers that can scale.  The data and content representation can be driven by your servers and purr like a kitten through a digg or reddit front page.

Software Configuration Management (SCM)

Subversion is popular now and it is the 80% source control system of the day (many thanks to Google Code using it).  But a new type of source control management has been in effect in version control systems (VCS) in such source control platforms as Git and Mercurial.  These non centralized systems are unlike Subversion, CVS, VSS which are centralized system. Centralized systems have benefits to management but also fail the distributed and redundancy issues that might arise without a distributed source control system.
Services like gitorious and github are services in the cloud handling your distributed cloud based source control system so you don’t have to manage either.

Personal Data

Email, documents, spreadsheets, photo storage, videos and many other areas of personal data and life are moving online.  Gmail, Google Docs, Zoho, and many others are making online office a reality.  Many people switched in 2007 to online office and you can bet that trend will continue.  Your documents are safer at google than on your local drives.
Data is no longer stored locally for many things.  There are no longer on a specific drive or disk that is physical in your reach.  These are stored on a server somewhere, out there.  Out there becomes the cloud, one ominous, massive collection of data.  The only way to find it is with semantics and authentication to gain access to it.  I believe that the cloud will make personal data much more manageable that matches people’s brain.  Google keyword searching and semantic uris are two key elements that I think mimic the brains behavior to perform more efficient. Storing your data this way is natural.  Have you tried gmail search compared to outlook, see a difference?  

And More

These are just briefly some of the technologies to watch and especially when it comes to infrastructure.  I think the trend is towards more offsite, redundant, cloud based computing and storage not only for cost savings in maintenance and management but also to allow growth to be in line with visitors or usage.  No reason to pay for servers for peak performance all the time.  Some services might only have a once a year spike like maybe taxes or the “The Big Game” with the NFL.  This way costs can be better defined and growth or spikes just increase the cloud capacity for handling them.
Elastic Computing for instance like Amazon EC2 initiative where server capacity is available on demand with the spike in traffic and expands or contracts based on need.  
Amazon is actually quite far ahead in the web services and cloud markets.  They have a strong offering and their stock jump in 2007 may have been warranted.
There are many more facets to the software web cloud that is being constructed and used heavily that will be appearing here so stay tuned.

drawcloud – cloud tracking, service and product development is proudly powered by WordPress
Entries (RSS) and Comments (RSS).
Your Ad Here Your Ad Here