{"id":171,"date":"2019-07-10T04:04:22","date_gmt":"2019-07-09T22:34:22","guid":{"rendered":"https:\/\/techieshouts.com\/?p=171"},"modified":"2022-08-09T19:08:30","modified_gmt":"2022-08-09T13:38:30","slug":"what-is-bigdata","status":"publish","type":"post","link":"https:\/\/techieshouts.com\/home\/what-is-bigdata\/","title":{"rendered":"What is BigData?"},"content":{"rendered":"\n<p>What is BigData? It is the question that will arise in the minds when someone wants to learn Hadoop and other distributed processing tools. <\/p>\n\n\n\n<p>There is no end to learn about any technology as technology is growing along with you each day.<\/p>\n\n\n\n<div class=\"wp-block-ub-notification-box\"><div class=\"ub_notify_info\"><p class=\"ub_notify_text\" style=\"text-align:left\"> Over the last decade, there has been an exponential increase in data in every sector. Estimation says it is 2.5 exabytes\/day <\/p><\/div><\/div>\n\n\n\n<p>Companies like Facebook, Twitter, Google are already generating Petabytes of data every day.<\/p>\n\n\n\n<div class=\"wp-block-ub-table-of-contents ub_table-of-contents\" data-showtext=\"show\" data-hidetext=\"hide\"><div class=\"ub_table-of-contents-header\"><div class=\"ub_table-of-contents-title\">Contents<\/div><\/div><div style=\"display:block\" class=\"ub_table-of-contents-container ub_table-of-contents-1-column\"><ul><li><a href=\"#0-what-is-big-data\">What is Big data?<\/a><\/li><li><a href=\"#1-various-sources-for-databigdata\">Various sources for data(BigData)<\/a><\/li><li><a href=\"#1--types-of-data-\">Types of Data<\/a><\/li><li><a href=\"#3--attributes-that-describe-bigdata-\">Attributes that describe Bigdata<\/a><\/li><\/ul><\/div><\/div>\n\n\n\n<h2 id=\"0-what-is-big-data\">What is Big data?<\/h2>\n\n\n\n<blockquote class=\"wp-block-quote is-style-default\"><p>&#8220;Big data are nothing but a collection of large and complex data sets that are difficult to process using traditional database management tools or traditional data processing applications&#8221;<\/p><\/blockquote>\n\n\n\n<p>When the complexity increases, the efficiency of the traditional tool will decrease. <\/p>\n\n\n\n<h2 id=\"1-various-sources-for-databigdata\">Various sources for data(BigData)<\/h2>\n\n\n\n<p>Stock exchange 1 TB, Smartphones, Youtube upload contributes 48 hrs of data\/min, social networks like Twitter, FB(more than 10 TB data daily), 30 million network sensors across the globe generating data throughout the day, Instagram, etc<\/p>\n\n\n\n<p>There are a few types in which we can categorize these rapidly growing data.<\/p>\n\n\n\n<h2 id=\"1--types-of-data-\"><strong>Types of Data<\/strong><\/h2>\n\n\n\n<ul><li><strong>Structured<\/strong> \u2013 The structured data are data that have a proper schema like RDBMS(with structures).<\/li><li><strong>Semistructured<\/strong> \u2013 Semi-structured data has its own structure but it doesn\u2019t have the detailed schematic way of storing data. This includes XML, JSON.<\/li><li><strong>Unstructured<\/strong> \u2013 These are data that don\u2019t have a structure but only data in it. It includes weblogs, anti-virus logs, etc.<\/li><\/ul>\n\n\n\n<p>Big data is a broad term for large volume or complex data sets that are difficult to process using simple data-management tools and applications.<\/p>\n\n\n\n<h2 id=\"3--attributes-that-describe-bigdata-\"><strong>Attributes that describe Bigdata<\/strong><\/h2>\n\n\n\n<ul><li><strong>Variety <\/strong>\u2013 This is nothing but the different types of complex data that is getting generated like sensors, stock exchange updates, social network data, etc.<\/li><li><strong>Velocity <\/strong>\u2013 This is the pace in which the data are getting dumped and retrieved for processing.<\/li><li><strong>Volume <\/strong>\u2013 Volume represents the amount of data that is stored over a period of time.<\/li><\/ul>\n\n\n\n<p>So, handling this much data is a definite problem. Isn\u2019t it? \u201cBig data\u201d is a problem statement and one of the solutions for handling this is with the help of a framework called \u201cHadoop\u201d.<\/p>\n\n\n\n<p><strong>Hadoop <\/strong>is a framework that helps to solve this problem statement with its framework components. When you think about handling BigData there are two main things that will come in the mind.<\/p>\n\n\n\n<ol><li>Storage<\/li><li>Processing<\/li><\/ol>\n\n\n\n<ul><li>HDFS(Hadoop distributed file system) takes care of storage.<\/li><li>MapReduce(MR) takes care of processing.<\/li><\/ul>\n\n\n\n<p><\/p>\n\n\n\n<iframe loading=\"lazy\" src=\"https:\/\/docs.google.com\/presentation\/d\/e\/2PACX-1vR-rQqsDvUKJzwpwvxbX65ayUgoSfZd8uTvfUF9Yic-X7iLi881jpPztGGD4JMIRJg0DwXrbd_VFYIl\/embed?start=false&amp;loop=false&amp;delayms=3000\" frameborder=\"0\" width=\"640\" height=\"389\" allowfullscreen=\"true\" mozallowfullscreen=\"true\" webkitallowfullscreen=\"true\"><\/iframe>\n\n\n\n\n\n<p>Also, check &#8220;<a href=\"https:\/\/techieshouts.com\/what-is-hadoop\/\">What is Hadoop?<\/a>&#8220;<\/p>\n\n\n\n<p>Reference &#8211;  <a href=\"https:\/\/docs.microsoft.com\/en-us\/sql\/t-sql\/queries\/select-into-clause-transact-sql?view=sql-server-2017\" target=\"_blank\" rel=\"noopener\">BigData wiki<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>What is BigData? It is the question that will arise in the minds when someone wants to learn Hadoop and other distributed processing tools. There is no end to learn about any technology as technology is growing along with you each day. Companies like Facebook, Twitter, Google are already generating Petabytes of data every day.\u2026 <span class=\"read-more\"><a href=\"https:\/\/techieshouts.com\/home\/what-is-bigdata\/\">Read More &raquo;<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":289,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[11],"tags":[26,25,48,49],"_links":{"self":[{"href":"https:\/\/techieshouts.com\/home\/wp-json\/wp\/v2\/posts\/171"}],"collection":[{"href":"https:\/\/techieshouts.com\/home\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/techieshouts.com\/home\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/techieshouts.com\/home\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/techieshouts.com\/home\/wp-json\/wp\/v2\/comments?post=171"}],"version-history":[{"count":16,"href":"https:\/\/techieshouts.com\/home\/wp-json\/wp\/v2\/posts\/171\/revisions"}],"predecessor-version":[{"id":560,"href":"https:\/\/techieshouts.com\/home\/wp-json\/wp\/v2\/posts\/171\/revisions\/560"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/techieshouts.com\/home\/wp-json\/wp\/v2\/media\/289"}],"wp:attachment":[{"href":"https:\/\/techieshouts.com\/home\/wp-json\/wp\/v2\/media?parent=171"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/techieshouts.com\/home\/wp-json\/wp\/v2\/categories?post=171"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/techieshouts.com\/home\/wp-json\/wp\/v2\/tags?post=171"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}