2. Distributed Filesystem
Apache HDFS
Red Hat GlusterFS
NoSQL Databases
Apache Hbase
Apache Cassandra
Key-Value Data Model
Redis DB
LinkedIN Voldermort
Distributed Filesystem
Apache HDFS
Red Hat GlusterFS
Distributed Programming
Apache MapReduce
Apache Pig
Document Data Model
MongoDB
RethinkDB
Graph Data Model
ArangoDB
TitanDB
Distributed Filesystem
Apache HDFS
Red Hat GlusterFS
Here is a limited list of the BigData Ecosystem
@EdPimentl
3. Data Ingestion
Apache Flume
Apache Storm
Scheduling
Apache Falcon
Apache Oozie
System Development
Apache Ambari
Cloudera HUE
Apache Mesos
Service Programming
Apache Zookeeper
LinkedIn Norbert
Twitter Elephant Bird
Machine Learning
WEKA
Cloudera Oryx
Apache Mahout
Others
Accumulo
SQL-on-Hadoop
Apache Hive
Apache Drill
Here is a limited list of the BigData Ecosystem
@EdPimentl
4. What is a Byte, Kilobyte, Megabyte, Gigabyte, Terabyte, Petabyte, and Exabyte?
Bytes(8 bits)
0.1 bytes:A binary decision
Kilobyte (1000 bytes)
2 Kilobytes:A Typewritten page
Megabyte (1 000 000 bytes)
2 Megabytes:A high resolution photograph
Gigabyte (1 000 000 000 bytes)
1 Gigabyte:A pickup truck filled with paper OR A symphony in high-fidelity sound OR A movie at TV quality
Terabyte (1 000 000 000 000 bytes)
10 Terabytes:The printed collection of the US Library of Congress
Petabyte (1 000 000 000 000 000 bytes)
2 Petabytes:All US academic research libraries
20 Petabytes: Production of hard-disk drives in 1995
Exabyte (1 000 000 000 000 000 000 bytes)
5 Exabytes:All words ever spoken by human beings
Nice description by Julian Bunn
5. Related Links
Open Data will hit every industry sector within 10 years https://lnkd.in/eBbzTY7
http://blog.knuthaugen.no/2010/03/a-brief-history-of-nosql.html
http://www.zdnet.com/article/traditional-databases-vs-the-threat-from-in-memory-nosql/?_escaped_fragment_=#!
http://arstechnica.com/information-technology/2013/07/the-hot-new-technology-in-big-data-is-decades-old-sql/
@EdPimentl