Summary: With more than 600 improvements and new features, MongoDB version 2.2 is a rounding out of a product that is no longer bleeding edge, but leading edge.
MongoDB version 2.2 is a rounding out of the product that includes a significant number of new lines of code and more than 600 new features. The latest release includes an advanced aggregation framework, new multi-data center deployment features, and more.
10gen has more than 175 people worldwide, with the goal of reaching 200 employees by the end of the year. They recently received an infusion of $74 million, and are backed by In-Q-Tel, a firm in turn backed by the U.S. Central Intelligence Agency. In-Q-Tel invests in technology that the CIA wants to use, to build specific features for them, or to build a community. 10gen also has received funding from NEA, Sequoia, Union Square Ventures and Flybridge.
Keyhole was one of their most famous endeavors: "Founded in 2001, was a pioneering software development company specializing in geospatial data visualization applications and was acquired by Google in 2004." Ultimately, Keyhole became Google Earth.
MongoDB is a NoSQL database offering built in C++, which allows you to support many languages. So, a SQL statement like,
Changes to:
So, what's new in MongoDB version 2.2?
"MongoDB 2.2 has been a huge effort to make the database even easier to use and operate," says Eliot Horowitz, 10gen co-founder and chief technology officer. "We think that moving to NoSQL should make you a more productive software engineer, and features like the aggregation framework deliver on that promise."
Aggregation Framework
With the accumulation of massive amounts of data, you need tools to allow you to easily and quickly talk to the data through the enablement of real-time queries. This new release also simplifies reporting and provides the foundation for real-time analytics.
According to MongoDB, release 2.2 can accelerate performance of analytics and reporting up to 80 percent compared to using MapReduce. Finally, the enhanced aggregation framework is significantly easier to use and execute than when using MapReduce and offers new operators, new expressions, and a pipeline-processing framework.
Some of the new operators include
The net of it is that these new operators allow you to simplify your statements further.
New multi-data center features
In addition to being able to scale horizontally, via an auto-sharding architecture which includes load and data distribution, MongoDB can easily scale to up to 1,000 machines with no downtime. And with the new automatic fail-over, along with multi-data center features, allow MongoDB to:
Allowing MongoDB to place load intelligently by tagging by region is
an administrative operation that allows it to reshuffle data to
optimizing it based on geography thereby reducing latency. The main
advantage of being able to write to multiple data centers at one time,
of course, is that it provides fault tolerance. Additionally, without
the notion of geography, the tags may be used for other uses such as
archiving.
Improved concurrency
MongoDB 2.2 features a new locking architecture that improves performance for workloads that require frequent disk I/O operations. Users will see faster, more predictable performance from MongoDB deployments, particularly in deployments where disk I/O speed is a limiter.
The concurrency improvements allow for:
New features and improvements
There are hundreds of improvements in the latest release, including:
Check out MongoDB for your self at www.mongodb.org.
Is your organization using or planning to use MongoDB? Let me know.