<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Big Data Archives | Cybiant</title>
	<atom:link href="https://www.cybiant.com/category/big-data/feed/" rel="self" type="application/rss+xml" />
	<link></link>
	<description>Win in the Data Economy</description>
	<lastBuildDate>Thu, 13 Jan 2022 08:48:09 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://www.cybiant.com/wp-content/uploads/2019/04/cropped-Logo-Cybiant-Site-Icon-32x32.png</url>
	<title>Big Data Archives | Cybiant</title>
	<link></link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Webinar: Enhancing PPP Performance in Asia Pacific</title>
		<link>https://www.cybiant.com/webinar-enhancing-ppp-performance-in-asia-pacific/</link>
					<comments>https://www.cybiant.com/webinar-enhancing-ppp-performance-in-asia-pacific/#respond</comments>
		
		<dc:creator><![CDATA[Cybiant]]></dc:creator>
		<pubDate>Tue, 04 Aug 2020 06:28:03 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Webinar]]></category>
		<category><![CDATA[PPP]]></category>
		<category><![CDATA[Public Private Partnerships]]></category>
		<guid isPermaLink="false">https://www.cybiant.com/?p=11856</guid>

					<description><![CDATA[<p>SEPTEMBER 3, 2020    11:00 AM - 12:00 PM SGT    As Public-Private-Partnerships schemes for the procurement of public infrastructure are growing quickly growing in popularity in Asia-Pacific, many governments are faced with the question how to professionalize their PPP Units or PPP Knowledge Hubs. Because of the  [...]</p>
<p>The post <a href="https://www.cybiant.com/webinar-enhancing-ppp-performance-in-asia-pacific/">Webinar: Enhancing PPP Performance in Asia Pacific</a> appeared first on <a href="https://www.cybiant.com">Cybiant</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p><div class="fusion-fullwidth fullwidth-box fusion-builder-row-1 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling" style="--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;" ><div class="fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap" style="max-width:1248px;margin-left: calc(-4% / 2 );margin-right: calc(-4% / 2 );"><div class="fusion-layout-column fusion_builder_column fusion-builder-column-0 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:1.92%;--awb-margin-bottom-large:0px;--awb-spacing-left-large:1.92%;--awb-width-medium:100%;--awb-spacing-right-medium:1.92%;--awb-spacing-left-medium:1.92%;--awb-width-small:100%;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-title title fusion-title-1 fusion-sep-none fusion-title-text fusion-title-size-four" style="--awb-margin-top-small:10px;--awb-margin-right-small:0px;--awb-margin-bottom-small:10px;--awb-margin-left-small:0px;"><h4 class="fusion-title-heading title-heading-left" style="margin:0;"><h1><iframe title="vimeo-player" src="https://player.vimeo.com/video/454306576" width="640" height="360" frameborder="0" allowfullscreen="allowfullscreen"></iframe></h1></h4></div></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-1 fusion_builder_column_1_2 1_2 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:50%;--awb-margin-top-large:0px;--awb-spacing-right-large:3.84%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:3.84%;--awb-width-medium:50%;--awb-spacing-right-medium:3.84%;--awb-spacing-left-medium:3.84%;--awb-width-small:100%;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><ul style="--awb-line-height:27.2px;--awb-icon-width:27.2px;--awb-icon-height:27.2px;--awb-icon-margin:11.2px;--awb-content-margin:38.4px;--awb-circlecolor:var(--awb-color4);--awb-circle-yes-font-size:14.08px;" class="fusion-checklist fusion-checklist-1 fusion-checklist-default type-icons"><li class="fusion-li-item" style=""><span class="icon-wrapper circle-yes"><i class="fusion-li-icon fa-calendar-alt fas" aria-hidden="true"></i></span><div class="fusion-li-item-content">
<p style="letter-spacing: 3px;">SEPTEMBER 3, 2020</p>
</div></li></ul></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-2 fusion_builder_column_1_2 1_2 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:50%;--awb-margin-top-large:0px;--awb-spacing-right-large:3.84%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:3.84%;--awb-width-medium:50%;--awb-spacing-right-medium:3.84%;--awb-spacing-left-medium:3.84%;--awb-width-small:100%;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><ul style="--awb-line-height:27.2px;--awb-icon-width:27.2px;--awb-icon-height:27.2px;--awb-icon-margin:11.2px;--awb-content-margin:38.4px;--awb-circlecolor:var(--awb-color4);--awb-circle-yes-font-size:14.08px;" class="fusion-checklist fusion-checklist-2 fusion-checklist-default type-icons"><li class="fusion-li-item" style=""><span class="icon-wrapper circle-yes"><i class="fusion-li-icon fa-clock fas" aria-hidden="true"></i></span><div class="fusion-li-item-content">
<p style="letter-spacing: 3px;">11:00 AM &#8211; 12:00 PM SGT</p>
</div></li></ul></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-3 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:1.92%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:1.92%;--awb-width-medium:100%;--awb-spacing-right-medium:1.92%;--awb-spacing-left-medium:1.92%;--awb-width-small:100%;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-text fusion-text-1"><p>As <a href="https://www.cybiant.com/public-private-partnerships-training/">Public-Private-Partnerships</a> schemes for the procurement of public infrastructure are growing quickly growing in popularity in Asia-Pacific, many governments are faced with the question how to professionalize their PPP Units or PPP Knowledge Hubs.</p>
<p>Because of the long-term nature (and corresponding risks) of PPP projects, it is crucial that a sound framework drives the selection, appraisal and tender phases of PPP projects. This requires knowledge and experience of PPP professionals, as well as a sound understanding of the market conditions and financial markets. So how can governments improve their capabilities in a structured and efficient way?</p>
<p>In this webinar, Cybiant will provide an overview of their Public-Private-Partnership Maturity Index® (PPPMI). Developed by Cybiant, the PPP Maturity Index uses the CP3P certification as a knowledge base and provides an assessment that measures PPP framework maturity, breaking them down in critical capabilities that are required to improve PPP success. We will showcase which critical capabilities are required, and how they can be translated into structure improvement programs.</p>
<p>This webinar is hosted jointly by APMG-International and Cybiant. APMG-International developed the CP3P Certification framework in collaboration with the World Bank. Cybiant was the first accredited PPP training provider in Asia, and has certified hundreds of individuals in PPP Best Practices.</p>
</div></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-4 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:1.92%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:1.92%;--awb-width-medium:100%;--awb-spacing-right-medium:1.92%;--awb-spacing-left-medium:1.92%;--awb-width-small:100%;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-text fusion-text-2"><p>This session will cover:<br />
&#8211; 45 minutes webinar<br />
&#8211; 15 minutes Q&amp;A</p>
<p><span class="morecontent">More information about Public Private Partnerships you can find <a href="https://www.cybiant.com/public-private-partnerships-training/">here</a>.</span></p>
</div><div class="fusion-text fusion-text-3"><p style="letter-spacing: 3px;">SPEAKERS:</p>
</div></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-5 fusion_builder_column_1_4 1_4 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:25%;--awb-margin-top-large:0px;--awb-spacing-right-large:7.68%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:7.68%;--awb-width-medium:25%;--awb-spacing-right-medium:7.68%;--awb-spacing-left-medium:7.68%;--awb-width-small:100%;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-text fusion-text-4"><p><img decoding="async" class="aligncenter wp-image-3325" src="https://www.cybiant.com/wp-content/uploads/2019/09/Untitled-design-6-150x150.png" alt="" width="150" height="103" srcset="https://www.cybiant.com/wp-content/uploads/2019/09/Untitled-design-6-200x137.png 200w, https://www.cybiant.com/wp-content/uploads/2019/09/Untitled-design-6.png 238w" sizes="(max-width: 150px) 100vw, 150px" /></p>
</div></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-6 fusion_builder_column_1_2 1_2 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:50%;--awb-margin-top-large:0px;--awb-spacing-right-large:3.84%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:3.84%;--awb-width-medium:50%;--awb-spacing-right-medium:3.84%;--awb-spacing-left-medium:3.84%;--awb-width-small:100%;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-person person fusion-person-left fusion-person-1 fusion-person-icon-top" style="--awb-pic-style-color:var(--awb-color8);--awb-pic-borderradius:0px;--awb-margin-top:0px;--awb-margin-right:0px;--awb-margin-bottom:0px;--awb-margin-left:0px;"><div class="person-desc" style="background-color:var(--awb-color8);padding:40px;margin-top:0;"><div class="person-author"><div class="person-author-wrapper"><span class="person-name">Jan-Willem Middelburg</span><span class="person-title">CEO Cybiant</span></div><div class="fusion-social-networks"><div class="fusion-social-networks-wrapper"><a class="fusion-social-network-icon fusion-tooltip fusion-twitter awb-icon-twitter" aria-label="fusion-twitter" href="https://twitter.com/jwmiddelburg" target="_blank" rel="noopener noreferrer" style="color:#000000;font-size:16px;" data-placement="bottom" data-title="Twitter" title="Twitter" data-toggle="tooltip"></a><a class="fusion-social-network-icon fusion-tooltip fusion-linkedin awb-icon-linkedin" aria-label="fusion-linkedin" href="https://www.linkedin.com/in/jwmiddelburg/" target="_blank" rel="noopener noreferrer" style="color:#0077b5;font-size:16px;" data-placement="bottom" data-title="LinkedIn" title="LinkedIn" data-toggle="tooltip"></a></div></div></div><div class="person-content fusion-clearfix"></div></div></div></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-7 fusion_builder_column_1_4 1_4 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:25%;--awb-margin-top-large:0px;--awb-spacing-right-large:7.68%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:7.68%;--awb-width-medium:25%;--awb-spacing-right-medium:7.68%;--awb-spacing-left-medium:7.68%;--awb-width-small:100%;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-8 fusion_builder_column_1_4 1_4 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:25%;--awb-margin-top-large:0px;--awb-spacing-right-large:7.68%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:7.68%;--awb-width-medium:25%;--awb-spacing-right-medium:7.68%;--awb-spacing-left-medium:7.68%;--awb-width-small:100%;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-text fusion-text-5"><p><img decoding="async" class="aligncenter wp-image-11865" src="https://www.cybiant.com/wp-content/uploads/2020/06/MK-APMG.png" alt="" width="103" height="103" srcset="https://www.cybiant.com/wp-content/uploads/2020/06/MK-APMG-66x66.png 66w, https://www.cybiant.com/wp-content/uploads/2020/06/MK-APMG-100x100.png 100w, https://www.cybiant.com/wp-content/uploads/2020/06/MK-APMG.png 150w" sizes="(max-width: 103px) 100vw, 103px" /></p>
</div></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-9 fusion_builder_column_1_2 1_2 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:50%;--awb-margin-top-large:0px;--awb-spacing-right-large:3.84%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:3.84%;--awb-width-medium:50%;--awb-spacing-right-medium:3.84%;--awb-spacing-left-medium:3.84%;--awb-width-small:100%;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-person person fusion-person-left fusion-person-2 fusion-person-icon-top" style="--awb-pic-style-color:var(--awb-color8);--awb-pic-borderradius:0px;--awb-margin-top:0px;--awb-margin-right:0px;--awb-margin-bottom:0px;--awb-margin-left:0px;"><div class="person-desc" style="background-color:var(--awb-color8);padding:40px;margin-top:0;"><div class="person-author"><div class="person-author-wrapper"><span class="person-name">MK Choong</span><span class="person-title">Regional Manager APMG International for the ASEAN and Japan region</span></div><div class="fusion-social-networks"><div class="fusion-social-networks-wrapper"><a class="fusion-social-network-icon fusion-tooltip fusion-twitter awb-icon-twitter" aria-label="fusion-twitter" href="https://twitter.com/APMG_Inter" target="_blank" rel="noopener noreferrer" style="color:#000000;font-size:16px;" data-placement="bottom" data-title="Twitter" title="Twitter" data-toggle="tooltip"></a><a class="fusion-social-network-icon fusion-tooltip fusion-linkedin awb-icon-linkedin" aria-label="fusion-linkedin" href="https://www.linkedin.com/in/mk-choong-48279041/" target="_blank" rel="noopener noreferrer" style="color:#0077b5;font-size:16px;" data-placement="bottom" data-title="LinkedIn" title="LinkedIn" data-toggle="tooltip"></a></div></div></div><div class="person-content fusion-clearfix"></div></div></div></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-10 fusion_builder_column_1_4 1_4 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:25%;--awb-margin-top-large:0px;--awb-spacing-right-large:7.68%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:7.68%;--awb-width-medium:25%;--awb-spacing-right-medium:7.68%;--awb-spacing-left-medium:7.68%;--awb-width-small:100%;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"></div></div></div></div><div class="fusion-fullwidth fullwidth-box fusion-builder-row-2 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling" style="--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;" ><div class="fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap" style="max-width:calc( 1200px + 0px );margin-left: calc(-0px / 2 );margin-right: calc(-0px / 2 );"></div></div><div class="fusion-fullwidth fullwidth-box fusion-builder-row-3 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling" style="--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;" ><div class="fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap" style="max-width:calc( 1200px + 0px );margin-left: calc(-0px / 2 );margin-right: calc(-0px / 2 );"><div class="fusion-layout-column fusion_builder_column fusion-builder-column-11 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:0px;--awb-margin-bottom-large:20px;--awb-spacing-left-large:0px;--awb-width-medium:100%;--awb-spacing-right-medium:0px;--awb-spacing-left-medium:0px;--awb-width-small:100%;--awb-spacing-right-small:0px;--awb-spacing-left-small:0px;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-separator fusion-full-width-sep" style="align-self: center;margin-left: auto;margin-right: auto;margin-top:20px;width:100%;"></div><div class="fusion-text fusion-text-6"><p style="letter-spacing: 3px;">MORE INFORMATION ON  PUBLIC PRIVATE PARTNERSHIPS TRAINING:</p>
</div><div class="fusion-image-element " style="--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);"><span class=" fusion-imageframe imageframe-none imageframe-1 hover-type-none"><a class="fusion-no-lightbox" href="https://www.cybiant.com/public-private-partnerships-training/" target="_self" aria-label="img cp3p badges"><img fetchpriority="high" decoding="async" width="750" height="250" src="https://www.cybiant.com/wp-content/uploads/2019/08/img-cp3p-badges.png" alt class="img-responsive wp-image-359" srcset="https://www.cybiant.com/wp-content/uploads/2019/08/img-cp3p-badges-200x67.png 200w, https://www.cybiant.com/wp-content/uploads/2019/08/img-cp3p-badges-400x133.png 400w, https://www.cybiant.com/wp-content/uploads/2019/08/img-cp3p-badges-600x200.png 600w, https://www.cybiant.com/wp-content/uploads/2019/08/img-cp3p-badges.png 750w" sizes="(max-width: 700px) 100vw, 750px" /></a></span></div></div></div></div></div><div class="fusion-fullwidth fullwidth-box fusion-builder-row-4 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling" style="--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;" ><div class="fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap" style="max-width:calc( 1200px + 0px );margin-left: calc(-0px / 2 );margin-right: calc(-0px / 2 );"></div></div></p>
<p>The post <a href="https://www.cybiant.com/webinar-enhancing-ppp-performance-in-asia-pacific/">Webinar: Enhancing PPP Performance in Asia Pacific</a> appeared first on <a href="https://www.cybiant.com">Cybiant</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.cybiant.com/webinar-enhancing-ppp-performance-in-asia-pacific/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Cybiant to host the first Enterprise Big Data Analyst virtual course</title>
		<link>https://www.cybiant.com/the-first-enterprise-big-data-analyst-virtual-course/</link>
					<comments>https://www.cybiant.com/the-first-enterprise-big-data-analyst-virtual-course/#respond</comments>
		
		<dc:creator><![CDATA[Cybiant]]></dc:creator>
		<pubDate>Mon, 27 Apr 2020 07:59:16 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Big Data Training]]></category>
		<category><![CDATA[enterprise big data framework]]></category>
		<guid isPermaLink="false">https://www.cybiant.com/?p=10675</guid>

					<description><![CDATA[<p>INTRODUCTION  What benefits can your organization achieve with Big Data? How to spot patterns in large quantities of enterprise data to improve performance and find underlying pattern that can benefit the organization? How can enterprises obtain a competitive advantage with Big Data? The Enterprise Big Data Analyst (EBDA) course discusses the theoretical concepts of Big  [...]</p>
<p>The post <a href="https://www.cybiant.com/the-first-enterprise-big-data-analyst-virtual-course/">Cybiant to host the first Enterprise Big Data Analyst virtual course</a> appeared first on <a href="https://www.cybiant.com">Cybiant</a>.</p>
]]></description>
										<content:encoded><![CDATA[<div class="fusion-fullwidth fullwidth-box fusion-builder-row-5 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling" style="--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;" ><div class="fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap" style="max-width:calc( 1200px + 0px );margin-left: calc(-0px / 2 );margin-right: calc(-0px / 2 );"><div class="fusion-layout-column fusion_builder_column fusion-builder-column-12 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:0px;--awb-margin-bottom-large:20px;--awb-spacing-left-large:0px;--awb-width-medium:100%;--awb-spacing-right-medium:0px;--awb-spacing-left-medium:0px;--awb-width-small:100%;--awb-spacing-right-small:0px;--awb-spacing-left-small:0px;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-text fusion-text-7"><p style="letter-spacing: 3px;"><span style="color: #00adee;">INTRODUCTION</span></p>
</div><div class="fusion-text fusion-text-8"><p>What benefits can your organization achieve with Big Data? How to spot patterns in large quantities of enterprise data to improve performance and find underlying pattern that can benefit the organization? How can enterprises obtain a competitive advantage with Big Data? The <a href="https://www.cybiant.com/training/enterprise-big-data-analyst/">Enterprise Big Data Analyst (EBDA) course</a> discusses the theoretical concepts of Big Data and subsequently applies this knowledge to a real-world case study.</p>
<p>The Big Data Framework provides a holistic and compressive approach for enterprises that aim to leverage the value of data in their organizations. The framework covers all the essential aspects of Big Data that are necessary to understand and analyse massive quantities of data. The Enterprise Big Data Analyst course is the second level of the <strong><a href="https://www.bigdataframework.org/">Big Data Framework</a></strong> course curriculum and certification program, that is globally recognized and accredited by <strong>APMG-International</strong>. The curriculum provides a vendor-neutral and objective understanding of Big Data architectures, technologies and processes.</p>
<p>The Enterprise Big Data Analyst qualification is a practitioner course for all professionals that aim to an in-depth understanding of Big Data analysis techniques and models, core data analysis processes steps, and best practices to retrieve value from data.</p>
</div><div class="fusion-text fusion-text-9"><p style="letter-spacing: 3px;"><span style="color: #00adee;">BENEFITS</span></p>
</div><div class="fusion-text fusion-text-10"><p>In this course, you will learn the following:</p>
<ul>
<li>Understand and explain the data <strong>analysis process</strong>, including all relevant steps included in enterprise big data analysis.</li>
<li>Understand the difference and structure of common <strong>data sources</strong> (local, online and database connections) and the way these sources should be imported in order to perform data analysis.</li>
<li>Apply and utilize fundamental <strong>data cleaning</strong> operations and the differences between different data cleaning techniques.</li>
<li>Apply and utilize fundamental <strong>data wrangling</strong> operations and the differences between different data wrangling techniques.</li>
<li>Understand and apply exploratory data <strong>analysis techniques</strong> that are required for model building, model validation and initial visualizations.</li>
<li>Understand and apply the core concepts of<strong> statistical inference</strong>, including techniques required for hypothesis testing.</li>
<li>Formulate and interpret <strong>predictive models</strong> based on statistical correlation and regression functions, including simple linear regression.</li>
<li>Formulate and interpret <strong>machine learning</strong> models for classification, including K-Nearest Neighbour, Naïve Bayes, Logistic Regression and Classification Trees.</li>
<li>Formulate and interpret machine learning models for <strong>clustering</strong>, including the Hierarchical clustering and K-means clustering techniques.</li>
<li>Formulate and interpret outlier <strong>detection models</strong>, including Grubbs Outlier detection and K-NN Outlier Detection.</li>
</ul>
</div><div class="fusion-text fusion-text-11"><p style="letter-spacing: 3px;"><span style="color: #00adee;">ABOUT THE TRAINER</span></p>
</div><div class="fusion-text fusion-text-12"><p><img decoding="async" class="size-thumbnail wp-image-3325 alignright" src="https://www.cybiant.com/wp-content/uploads/2019/09/Untitled-design-6-150x150.png" alt="Jan-Willem Middelburg, Author of the EBDA" width="150" height="150" srcset="https://www.cybiant.com/wp-content/uploads/2019/09/Untitled-design-6-66x66.png 66w, https://www.cybiant.com/wp-content/uploads/2019/09/Untitled-design-6-100x100.png 100w, https://www.cybiant.com/wp-content/uploads/2019/09/Untitled-design-6-150x150.png 150w" sizes="(max-width: 150px) 100vw, 150px" />Jan Willem Middelburg from the Netherlands has been in the information technology industry for the past 10 years and is the author and Chief Examiner of the Enterprise Big Data Framework.  Jan-Willem began his writing career in 2013 as one of the co-authors of the book <em>Serious Gaming</em>. Subsequent publications included the <em>Service Automation Framework</em> and the <em>Enterprise Big Data Framework. </em>Both publication have been translated into Chinese, and now form the basis for global education and certification schemes.</p>
</div><div class="fusion-text fusion-text-13"><p style="letter-spacing: 3px;"><span style="color: #00adee;">ACCREDITATION</span></p>
</div><div class="fusion-text fusion-text-14"><p>Cybiant is an accredited training provider with <strong>APMG International</strong> and one of the first to deliver the Enterprise Big Data Analyst course. Upon successful completion of the course, participants will receive the official Enterprise Big Data Analyst certification and digital badge. APMG International is a global well-established accreditation and certification body who is accredited by the United Kingdom Accreditation Service (UKAS) for ISO/ IEC17065: 2012 and ISO/ IEC17024: 2012.</p>
</div><div class="fusion-text fusion-text-15"><p style="letter-spacing: 3px;"><span style="color: #00adee;">COURSE &amp; DATE</span></p>
</div><div class="fusion-text fusion-text-16"><p>In conjunction with launching the certification, first learning class of Enterprise Big Data Analyst will be solely train by the framework author with live delivery in Asia. The following is the joining instructions for you to be the world first batch certified Enterprise Big Data Analyst:</p>
<ul>
<li><img decoding="async" class="size-thumbnail wp-image-5257 alignright" src="https://www.cybiant.com/wp-content/uploads/2019/11/Big-data-EBDA-September-2019-@600PX-150x150.png" alt="Enterprise Big Data Analyst" width="150" height="150" srcset="https://www.cybiant.com/wp-content/uploads/2019/11/Big-data-EBDA-September-2019-@600PX-66x66.png 66w, https://www.cybiant.com/wp-content/uploads/2019/11/Big-data-EBDA-September-2019-@600PX-100x100.png 100w, https://www.cybiant.com/wp-content/uploads/2019/11/Big-data-EBDA-September-2019-@600PX-150x150.png 150w, https://www.cybiant.com/wp-content/uploads/2019/11/Big-data-EBDA-September-2019-@600PX-200x200.png 200w, https://www.cybiant.com/wp-content/uploads/2019/11/Big-data-EBDA-September-2019-@600PX-300x300.png 300w, https://www.cybiant.com/wp-content/uploads/2019/11/Big-data-EBDA-September-2019-@600PX-400x400.png 400w, https://www.cybiant.com/wp-content/uploads/2019/11/Big-data-EBDA-September-2019-@600PX-500x500.png 500w, https://www.cybiant.com/wp-content/uploads/2019/11/Big-data-EBDA-September-2019-@600PX.png 600w" sizes="(max-width: 150px) 100vw, 150px" />Certification Name: Enterprise Big Data Analyst Course &amp; Exam</li>
<li>First Course Date: 11-15 May 2020/ 18-22 May 2020</li>
<li>Course Format: Virtual Learning</li>
<li>Learning Hour: Daily 9am-11am &amp; 1-3pm (GMT+8)</li>
<li>Pre-requisite: Enterprise Big Data Professional Holder</li>
<li>Investment Level: MYR 5200| SGD 2500| USD 1800</li>
</ul>
</div><div class="fusion-image-element " style="text-align:center;--awb-liftup-border-radius:0px;--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);"><div class="awb-image-frame awb-image-frame-2 imageframe-liftup"><span class=" fusion-imageframe imageframe-none imageframe-2"><img decoding="async" width="300" height="151" alt="Cybiant Virtual Classroom | Cybiant Virtual Learning" title="Virtual Learning" src="https://www.cybiant.com/wp-content/uploads/2020/04/Virtual-Learning-300x151.png" class="img-responsive wp-image-10326" srcset="https://www.cybiant.com/wp-content/uploads/2020/04/Virtual-Learning-200x101.png 200w, https://www.cybiant.com/wp-content/uploads/2020/04/Virtual-Learning-400x202.png 400w, https://www.cybiant.com/wp-content/uploads/2020/04/Virtual-Learning-600x303.png 600w, https://www.cybiant.com/wp-content/uploads/2020/04/Virtual-Learning-800x404.png 800w, https://www.cybiant.com/wp-content/uploads/2020/04/Virtual-Learning-1200x606.png 1200w, https://www.cybiant.com/wp-content/uploads/2020/04/Virtual-Learning.png 1294w" sizes="(max-width: 700px) 100vw, 1200px" /></span></div></div><div class="fusion-separator fusion-full-width-sep" style="align-self: center;margin-left: auto;margin-right: auto;margin-top:20px;width:100%;"></div><div style="text-align:center;"><a class="fusion-button button-3d fusion-button-default-size button-default fusion-button-default button-1 fusion-button-span-yes " target="_self" href="mailto:info@cybiant.com"><i class="fa-envelope fas awb-button__icon awb-button__icon--default button-icon-left" aria-hidden="true"></i><span class="fusion-button-text awb-button__text awb-button__text--default">More information</span></a></div></div></div></div></div>
<p>The post <a href="https://www.cybiant.com/the-first-enterprise-big-data-analyst-virtual-course/">Cybiant to host the first Enterprise Big Data Analyst virtual course</a> appeared first on <a href="https://www.cybiant.com">Cybiant</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.cybiant.com/the-first-enterprise-big-data-analyst-virtual-course/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>The Next Level of the Enterprise Big Data Framework now available</title>
		<link>https://www.cybiant.com/the-next-level-of-the-enterprise-big-data-framework-now-available/</link>
					<comments>https://www.cybiant.com/the-next-level-of-the-enterprise-big-data-framework-now-available/#respond</comments>
		
		<dc:creator><![CDATA[Cybiant]]></dc:creator>
		<pubDate>Fri, 17 Apr 2020 04:28:06 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[enterprise big data framework]]></category>
		<guid isPermaLink="false">https://www.cybiant.com/?p=10466</guid>

					<description><![CDATA[<p>Great news for the Enterprise Big Data Professional certified professionals!  The wait is over, the second level of the Big Data Framework is finally here! The Enterprise Big Data Analyst qualification builds upon the first level of the Big Data Framework qualification scheme Enterprise Big Data Professional® in which fundamental knowledge and elementary  [...]</p>
<p>The post <a href="https://www.cybiant.com/the-next-level-of-the-enterprise-big-data-framework-now-available/">The Next Level of the Enterprise Big Data Framework now available</a> appeared first on <a href="https://www.cybiant.com">Cybiant</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p><div class="fusion-fullwidth fullwidth-box fusion-builder-row-6 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling" style="--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;" ><div class="fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap" style="max-width:calc( 1200px + 0px );margin-left: calc(-0px / 2 );margin-right: calc(-0px / 2 );"><div class="fusion-layout-column fusion_builder_column fusion-builder-column-13 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:0px;--awb-margin-bottom-large:20px;--awb-spacing-left-large:0px;--awb-width-medium:100%;--awb-spacing-right-medium:0px;--awb-spacing-left-medium:0px;--awb-width-small:100%;--awb-spacing-right-small:0px;--awb-spacing-left-small:0px;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-title title fusion-title-2 fusion-sep-none fusion-title-text fusion-title-size-two" style="--awb-margin-top-small:10px;--awb-margin-right-small:0px;--awb-margin-bottom-small:10px;--awb-margin-left-small:0px;"><h2 class="fusion-title-heading title-heading-left" style="margin:0;"><strong>Great news for the Enterprise Big Data Professional certified professionals!</strong></h2></div><div class="fusion-text fusion-text-17" style="--awb-text-transform:none;"><h4 style="text-align: justify;">The wait is over, the second level of the <strong>Big Data Framework</strong> is finally here! The <strong>Enterprise Big Data Analyst</strong> qualification builds upon the first level of the Big Data Framework qualification scheme <strong><em>Enterprise Big Data Professional</em></strong>® in which fundamental knowledge and elementary concepts related to Big Data were covered.</h4>
</div><div class="fusion-text fusion-text-18" style="--awb-text-transform:none;"><p style="text-align: justify;">With the EBDA® certification, you can improve your credibility and knowledge in Big Data and take it to the next level. The <a href="https://www.cybiant.com/training/enterprise-big-data-analyst/">Enterprise Big Data Analyst</a> is a practitioner course for all professionals that aim to have an in-depth understanding of Big Data analysis techniques and models, core data analysis processes steps, and best practices to retrieve value from data.</p>
<p style="text-align: justify;">Recently, many organizations have started to use Big Data solutions to forecast and understand the impact of the pandemic Covid-19. And many businesses are turning to Big Data solutions to make critical decisions. The need for more Big Data professionals is evident.</p>
<p style="text-align: justify;">The Enterprise Big Data Analyst (EBDA®) qualification shows that candidates possess the skills to analyze Big Data and are able to understand key data analysis concepts, R Programming language and techniques to correctly interpret data to make better decisions for companies or governments.</p>
<p style="text-align: justify;">The Enterprise Big Data Analyst (EBDA®) certification, accredited by <strong><em>APMG international</em></strong> (a global well-established accreditation and certification body), will be awarded to candidates who have passed the exam. <em><strong>Cybiant is the first Accredited Training Organization (ATO) to deliver the Enterprise Big Data Analyst course and certification in Asia.</strong></em></p>
<h4 style="text-align: justify;"></h4>
<h4 style="text-align: justify;"></h4>
</div><div class="fusion-text fusion-text-19"><p><strong><u>Course Date &amp; Fee</u></strong></p>
</div><ul style="--awb-iconcolor:#03a9f4;--awb-line-height:27.2px;--awb-icon-width:27.2px;--awb-icon-height:27.2px;--awb-icon-margin:11.2px;--awb-content-margin:38.4px;--awb-circlecolor:#ffffff;--awb-circle-yes-font-size:14.08px;" class="fusion-checklist fusion-checklist-3 fusion-checklist-default type-icons"><li class="fusion-li-item" style=""><span class="icon-wrapper circle-yes"><i class="fusion-li-icon fa-chevron-circle-right fas" aria-hidden="true"></i></span><div class="fusion-li-item-content">
<p>Certification Name: Enterprise Big Data Analyst Course &amp; Exam</p>
</div></li><li class="fusion-li-item" style=""><span class="icon-wrapper circle-yes"><i class="fusion-li-icon fa-chevron-circle-right fas" aria-hidden="true"></i></span><div class="fusion-li-item-content">
<p>Date: 11-15 May 2020</p>
</div></li><li class="fusion-li-item" style=""><span class="icon-wrapper circle-yes"><i class="fusion-li-icon fa-chevron-circle-right fas" aria-hidden="true"></i></span><div class="fusion-li-item-content">
<p>Learning Hours: Daily 9am-11am &amp; 1-3pm (GMT+8)</p>
</div></li><li class="fusion-li-item" style=""><span class="icon-wrapper circle-yes"><i class="fusion-li-icon fa-chevron-circle-right fas" aria-hidden="true"></i></span><div class="fusion-li-item-content">
<p>Pre-requisite: Enterprise Big Data Professional Holder</p>
</div></li><li class="fusion-li-item" style=""><span class="icon-wrapper circle-yes"><i class="fusion-li-icon fa-chevron-circle-right fas" aria-hidden="true"></i></span><div class="fusion-li-item-content">
<p>Promotion Price: MYR 5200 | SGD 2500| USD 1800 (including examination)</p>
</div></li></ul><div class="fusion-separator fusion-full-width-sep" style="align-self: center;margin-left: auto;margin-right: auto;margin-top:25px;width:100%;"></div><div class="fusion-text fusion-text-20"><p><strong><u>Trainer Bio</u></strong></p>
</div><div class="fusion-text fusion-text-21" style="--awb-text-transform:none;"><p style="text-align: justify;"><img decoding="async" class="alignright wp-image-3325 size-full" src="https://www.cybiant.com/wp-content/uploads/2019/09/Untitled-design-6.png" alt="Jan-Willem Middelburg, Author of the EBDA" width="238" height="163" srcset="https://www.cybiant.com/wp-content/uploads/2019/09/Untitled-design-6-200x137.png 200w, https://www.cybiant.com/wp-content/uploads/2019/09/Untitled-design-6.png 238w" sizes="(max-width: 238px) 100vw, 238px" />Jan Willem Middelburg is the author and chief examiner of the Big Data Framework. He has been in the information technology and education industries for the past 10 years and known for his experience in ITSM framework as the certified ITIL Expert practitioner. He had written numerous books such as the “Service Automation Framework”, “Big Data Framework” and also co-authored the book “Serious Gaming”.</p>
</div></div></div></div></div><div class="fusion-fullwidth fullwidth-box fusion-builder-row-7 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling" style="--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;" ><div class="fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap" style="max-width:calc( 1200px + 0px );margin-left: calc(-0px / 2 );margin-right: calc(-0px / 2 );"><div class="fusion-layout-column fusion_builder_column fusion-builder-column-14 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:0px;--awb-margin-bottom-large:20px;--awb-spacing-left-large:0px;--awb-width-medium:100%;--awb-spacing-right-medium:0px;--awb-spacing-left-medium:0px;--awb-width-small:100%;--awb-spacing-right-small:0px;--awb-spacing-left-small:0px;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-text fusion-text-22"><p><strong><u>Register Form</u></strong></p>
</div><div class="fusion-text fusion-text-23"><p>[wpforms id=&#8221;10475&#8243;]</p>
</div></div></div></div></div></p>
<p>The post <a href="https://www.cybiant.com/the-next-level-of-the-enterprise-big-data-framework-now-available/">The Next Level of the Enterprise Big Data Framework now available</a> appeared first on <a href="https://www.cybiant.com">Cybiant</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.cybiant.com/the-next-level-of-the-enterprise-big-data-framework-now-available/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Decision Tree: A powerful and intuitive process to predict churn</title>
		<link>https://www.cybiant.com/decision-tree-a-powerful-and-intuitive-process-to-predict-churn/</link>
					<comments>https://www.cybiant.com/decision-tree-a-powerful-and-intuitive-process-to-predict-churn/#respond</comments>
		
		<dc:creator><![CDATA[Cybiant]]></dc:creator>
		<pubDate>Wed, 15 Apr 2020 05:28:47 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[White Paper]]></category>
		<guid isPermaLink="false">https://www.cybiant.com/?p=10429</guid>

					<description><![CDATA[<p>Download full case study   Decision Tree: A powerful and intuitive process to predict churn (Case Study) There are many techniques available for the analysis of large-scale data. Churn-data of Telco companies can be analysed in various ways. Some focus on understanding the key characteristics of customers who will end their contract in  [...]</p>
<p>The post <a href="https://www.cybiant.com/decision-tree-a-powerful-and-intuitive-process-to-predict-churn/">Decision Tree: A powerful and intuitive process to predict churn</a> appeared first on <a href="https://www.cybiant.com">Cybiant</a>.</p>
]]></description>
										<content:encoded><![CDATA[<div class="fusion-fullwidth fullwidth-box fusion-builder-row-8 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling" style="--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;" ><div class="fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap" style="max-width:calc( 1200px + 0px );margin-left: calc(-0px / 2 );margin-right: calc(-0px / 2 );"><div class="fusion-layout-column fusion_builder_column fusion-builder-column-15 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:0px;--awb-margin-bottom-large:20px;--awb-spacing-left-large:0px;--awb-width-medium:100%;--awb-spacing-right-medium:0px;--awb-spacing-left-medium:0px;--awb-width-small:100%;--awb-spacing-right-small:0px;--awb-spacing-left-small:0px;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-image-element " style="--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);"><span class=" fusion-imageframe imageframe-none imageframe-3 hover-type-none"><img decoding="async" width="1000" height="500" title="BLOG Cybiant (10)" src="https://www.cybiant.com/wp-content/uploads/2020/04/BLOG-Cybiant-10.jpg" alt class="img-responsive wp-image-10431" srcset="https://www.cybiant.com/wp-content/uploads/2020/04/BLOG-Cybiant-10-200x100.jpg 200w, https://www.cybiant.com/wp-content/uploads/2020/04/BLOG-Cybiant-10-400x200.jpg 400w, https://www.cybiant.com/wp-content/uploads/2020/04/BLOG-Cybiant-10-600x300.jpg 600w, https://www.cybiant.com/wp-content/uploads/2020/04/BLOG-Cybiant-10-800x400.jpg 800w, https://www.cybiant.com/wp-content/uploads/2020/04/BLOG-Cybiant-10.jpg 1000w" sizes="(max-width: 700px) 100vw, 1000px" /></span></div><div class="fusion-separator fusion-full-width-sep" style="align-self: center;margin-left: auto;margin-right: auto;margin-top:20px;width:100%;"></div><div style="text-align:center;"><a class="fusion-button button-3d fusion-button-default-size button-default fusion-button-default button-2 fusion-button-default-span " target="_self" href="https://www.cybiant.com/download/10399/"><span class="fusion-button-text awb-button__text awb-button__text--default">Download full case study </span></a></div><div class="fusion-separator fusion-full-width-sep" style="align-self: center;margin-left: auto;margin-right: auto;margin-top:20px;width:100%;"></div><div class="fusion-text fusion-text-24" style="--awb-text-transform:none;"><h2 style="text-align: justify;">Decision Tree: A powerful and intuitive process to predict churn (Case Study)</h2>
<p style="text-align: justify;">There are many techniques available for the analysis of large-scale data. Churn-data of Telco companies can be analysed in various ways. Some focus on understanding the key characteristics of customers who will end their contract in the near future. Others concentrate more on predicting as precisely as possible which customers are expected to churn. The drawback of the second group of methods is that they are often very complex and not easy to follow for people without a data-science background. We believe decision trees fit perfectly in between these two groups. It gives possible to predict churn for large-scale data, with many different attributes. Furthermore, it can be visualized as a tree, which displays very clearly how the is built and how it is used to make predictions.</p>
<p style="text-align: justify;">An example of a decision tree for a sample dataset of a Telco company is displayed below. The target variable of this tree is customer churn. The tree can be seen as a kind of flow-chart, you start at the top and follow some questions to end up in a certain bucket at the bottom. The tree could be followed manually, such that each customer could be placed in one of the buckets. The splits in the tree are questions about the values of each attribute of the customer. All customers that end up in a bucket have certain characteristics in common. In this way, we can assign a probability of churn to each bucket.</p>
<p style="text-align: justify;">The most complex part of the decision tree is the algorithm which is used to generate the tree, although the main idea is quite intuitive. The goal is to make each bucket as pure as possible. In a perfectly pure bucket, all customers would churn or all customers would not. We start with the whole dataset. Then the algorithm strives to find the attribute to split on, which makes the two subsets the purest as possible. After selecting the first split, the same process is repeated for the part of the data one step lower in the tree, and the same for the next step.</p>
</div><div class="fusion-reading-box-container reading-box-container-1" style="--awb-title-color:var(--awb-color8);--awb-margin-top:0px;--awb-margin-bottom:20px;"><div class="reading-box" style="background-color:var(--awb-color1);border-width:0px;border-color:#03a9f4;border-left-width:3px;border-left-color:var(--primary_color);border-style:solid;"><div class="reading-box-description">This model reaches an accuracy of 79.3%. This means that it predicts
8 out of 10 times correctly whether a customer is likely to churn or not.
This information will be highly valuable for a Telco company.</div><div class="fusion-clearfix"></div></div></div><div class="fusion-text fusion-text-25" style="--awb-text-transform:none;"><p style="text-align: justify;">When the model is created the most difficult part of the work is already done. We used data from the past to build the tree. Now we can apply all that is learned from historical data to new data of different customers. It enables us to go through the tree for each customer and predict whether they are likely to end their contract in the near future or not. Of course, we are not going to follow the tree by hand but let the computers do the work. This model could be applied to an unlimited quantity of customers, who would all end up in one of the buckets.</p>
<p style="text-align: justify;">It is important to keep in mind that we are working with a model. The goal is to predict as correctly as possible which customers are going to end their contract. We can never achieve a 100% accuracy on the model. A crucial aspect of the tree for the accuracy of prediction is how many nodes it includes. Of course, the tree could be split such that there is a separate bucket for practically every customer. This would give nearly 100% accuracy if one runs it over the same data which is used for building the tree. For new data, this would not be the correct approach. A tree that has too little nodes is not specific enough. This results in a model that performs poorly, both on the old and the new data. The trick is to find a sweet spot in between those ends.</p>
<p style="text-align: justify;">We obtained a maximum accuracy of 79% for predicting churn on a test dataset. In other words, the model predicts in almost eight out of ten cases correctly whether the customer is going to end his/her contract soon or not. This information shows Telco companies which customers who are likely to end their contract. They could use these insights to target customers in order to retain them for a longer period.</p>
</div><div class="fusion-image-element " style="--awb-caption-title-font-family:var(--h2_typography-font-family);--awb-caption-title-font-weight:var(--h2_typography-font-weight);--awb-caption-title-font-style:var(--h2_typography-font-style);--awb-caption-title-size:var(--h2_typography-font-size);--awb-caption-title-transform:var(--h2_typography-text-transform);--awb-caption-title-line-height:var(--h2_typography-line-height);--awb-caption-title-letter-spacing:var(--h2_typography-letter-spacing);"><span class=" fusion-imageframe imageframe-none imageframe-4 hover-type-none"><img decoding="async" width="1840" height="960" title="figure 5" src="https://www.cybiant.com/wp-content/uploads/2020/04/figure-5-new.png" alt class="img-responsive wp-image-10408" srcset="https://www.cybiant.com/wp-content/uploads/2020/04/figure-5-new-200x104.png 200w, https://www.cybiant.com/wp-content/uploads/2020/04/figure-5-new-400x209.png 400w, https://www.cybiant.com/wp-content/uploads/2020/04/figure-5-new-600x313.png 600w, https://www.cybiant.com/wp-content/uploads/2020/04/figure-5-new-800x417.png 800w, https://www.cybiant.com/wp-content/uploads/2020/04/figure-5-new-1200x626.png 1200w, https://www.cybiant.com/wp-content/uploads/2020/04/figure-5-new.png 1840w" sizes="(max-width: 700px) 100vw, 1200px" /></span></div><div class="fusion-separator fusion-full-width-sep" style="align-self: center;margin-left: auto;margin-right: auto;margin-top:20px;width:100%;"></div><div style="text-align:center;"><a class="fusion-button button-3d fusion-button-default-size button-default fusion-button-default button-3 fusion-button-default-span " target="_self" href="https://www.cybiant.com/download/10399/"><span class="fusion-button-text awb-button__text awb-button__text--default">Download full case study </span></a></div></div></div></div></div>
<p>The post <a href="https://www.cybiant.com/decision-tree-a-powerful-and-intuitive-process-to-predict-churn/">Decision Tree: A powerful and intuitive process to predict churn</a> appeared first on <a href="https://www.cybiant.com">Cybiant</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.cybiant.com/decision-tree-a-powerful-and-intuitive-process-to-predict-churn/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>New Case Study: Predicting Customer Churn in Power BI</title>
		<link>https://www.cybiant.com/new-case-study-predicting-customer-churn-in-power-bi/</link>
					<comments>https://www.cybiant.com/new-case-study-predicting-customer-churn-in-power-bi/#respond</comments>
		
		<dc:creator><![CDATA[Jan-Willem Middelburg]]></dc:creator>
		<pubDate>Sat, 28 Mar 2020 09:33:20 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Power BI]]></category>
		<category><![CDATA[Prediction]]></category>
		<guid isPermaLink="false">https://www.cybiant.com/?p=10300</guid>

					<description><![CDATA[<p>Introduction  Modern Telco companies gather lots of data on their customers. Varying from their gender to their monthly payments to the type of contract they have. One aspect that is especially important to keep track of is the group of customers that ends or will end their contract soon. Cybiant can help  [...]</p>
<p>The post <a href="https://www.cybiant.com/new-case-study-predicting-customer-churn-in-power-bi/">New Case Study: Predicting Customer Churn in Power BI</a> appeared first on <a href="https://www.cybiant.com">Cybiant</a>.</p>
]]></description>
										<content:encoded><![CDATA[<div class="fusion-fullwidth fullwidth-box fusion-builder-row-9 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling" style="--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;" ><div class="fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap" style="max-width:1248px;margin-left: calc(-4% / 2 );margin-right: calc(-4% / 2 );"><div class="fusion-layout-column fusion_builder_column fusion-builder-column-16 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:1.92%;--awb-margin-bottom-large:0px;--awb-spacing-left-large:1.92%;--awb-width-medium:100%;--awb-spacing-right-medium:1.92%;--awb-spacing-left-medium:1.92%;--awb-width-small:100%;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-flex-justify-content-flex-start fusion-content-layout-column"></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-17 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:1.92%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:1.92%;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:1.92%;--awb-spacing-left-medium:1.92%;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-text fusion-text-26" style="--awb-font-size:25px;--awb-text-transform:none;--awb-text-font-family:&quot;Poppins&quot;;--awb-text-font-style:normal;--awb-text-font-weight:700;"><p style="text-align: justify;">Introduction</p>
</div><div class="fusion-text fusion-text-27" style="--awb-text-transform:none;"><p style="text-align: justify;">Modern Telco companies gather lots of data on their customers. Varying from their gender to their monthly payments to the type of contract they have. One aspect that is especially important to keep track of is the group of customers that ends or will end their contract soon. Cybiant can help Telco companies to obtain valuable insights on how to identify customers that are likely to end their contract and to find which factors are indicating such behaviour.  There are two ways this dataset can be used to study contract ends. The first is determining which factors are typical of customers who end their contract, the second is identifying which customers are likely to end their contract. The focus of this article is on the former.  Therefore we built a showcase model with the powerful combination of Power BI and R.</p>
<p style="text-align: justify;">We use a sample dataset of the customer-base of a telco company for our analysis. The variable of interest is ‘churn’, which has two values: ”yes” if a customer ended their contract in the last month, and “no” otherwise. There are various different characteristics of customers in the dataset which can be divided in three main categories: services that each customer signed up for, customer account information and demographic information about each customer.</p>
<div id="attachment_10105" style="width: 610px" class="wp-caption aligncenter"><img decoding="async" aria-describedby="caption-attachment-10105" class="wp-image-10105 size-fusion-600" src="https://www.cybiant.com/wp-content/uploads/2020/03/PowerBI-Customer-Churn-600x337.png" alt="Calculating Customer Churn in PowerBI" width="600" height="337" srcset="https://www.cybiant.com/wp-content/uploads/2020/03/PowerBI-Customer-Churn-200x112.png 200w, https://www.cybiant.com/wp-content/uploads/2020/03/PowerBI-Customer-Churn-300x168.png 300w, https://www.cybiant.com/wp-content/uploads/2020/03/PowerBI-Customer-Churn-400x224.png 400w, https://www.cybiant.com/wp-content/uploads/2020/03/PowerBI-Customer-Churn-500x280.png 500w, https://www.cybiant.com/wp-content/uploads/2020/03/PowerBI-Customer-Churn-600x337.png 600w, https://www.cybiant.com/wp-content/uploads/2020/03/PowerBI-Customer-Churn-700x393.png 700w, https://www.cybiant.com/wp-content/uploads/2020/03/PowerBI-Customer-Churn-768x431.png 768w, https://www.cybiant.com/wp-content/uploads/2020/03/PowerBI-Customer-Churn-800x449.png 800w, https://www.cybiant.com/wp-content/uploads/2020/03/PowerBI-Customer-Churn.png 943w" sizes="(max-width: 600px) 100vw, 600px" /><p id="caption-attachment-10105" class="wp-caption-text">Figure 1: Summarizing Customer Data in Power BI</p></div>
<p style="text-align: justify;">After loading and cleaning the data in Power BI. We add a measure which is called the ‘churn-rate’, which is the percentage of the customers that ended their contract in the last month. The data can be visualized in various ways for different purposes. Some example of insightful visualizations are shown in figure 1.  We can observe the churn-rate of all customers in the top left. At the bottom is shown that the churn rate Fiber optic users is a lot higher that of DSL users or users which have no internet connection at all. This last group could have a lower churn rate, since it is more difficult to end you contract without an internet connection. The chart to the right demonstrates that the customers using Fiber-optic account for the largest part of total monthly income for the Telco company. This means that the largest group in income is also the group which has the highest churn rate. This seems to be a problem the company cannot neglect. At the pie chart becomes clear that the Fiber Optic customers are only 44% of the total number of customers, but they account 63% of monthly charges. It must be that they pay more monthly charges than an average customer.</p>
</div></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-18 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:1.92%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:1.92%;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:1.92%;--awb-spacing-left-medium:1.92%;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-text fusion-text-28" style="--awb-font-size:25px;--awb-text-transform:none;--awb-text-font-family:&quot;Poppins&quot;;--awb-text-font-style:normal;--awb-text-font-weight:700;"><p style="text-align: justify;">Attribute Selection: which variables indicate customer churn?</p>
</div><div class="fusion-text fusion-text-29" style="--awb-text-transform:none;"><p style="text-align: justify;">The next step is to determine which attributes are the most important in predicting which customers are going to end their contract. Nowadays businesses have access to such large amounts of data and so many different attributes of information, that it is difficult to determine where to start. Figure 2 shows the importance of each attribute for predicting whether a customer will churn or not. The type of contract is the most important attribute as shown in the figure above. This figure provides a direction to start looking for more information. It makes clear that the contract attribute has the highest predictive value for churn.  Not too much time should be spent on analysing the differences between genders since that attribute has no significant predictive value for churn. Figure 2 is just an example and can be easily extended to datasets with many more attributes.</p>
<p style="text-align: justify;">The used measure is called information gain(IG). We can split the data in different sets according to the categories of an attribute. The whole dataset has a certain purity. If all cases in the set have the same value for churn, the set is perfectly pure. In real-life big data perfect purity is rarely possible. , therefore we use a measure for the gain in purity when segmenting, which is IG. The set of customers can be divided into subsets according to the different values of an attribute. IG measures how much in purity is gained by segmenting on that particular attribute. A more in-depth explanation and the mathematical formulas can be found in the full article.</p>
<p style="text-align: justify;">Back to the Telco data. Figure 2 shows that the type of contract is the most important attribute. The bar chart in figure 1 shows us that people who have a more flexible month-to-month contract, are more likely to churn. The line in the bar chart informs us that the group of people with a flexible contract is responsible for the largest part of income.  The Telco company could use these insights in various ways. For instance try to move people from month-to-month to yearly contracts or to keep more in touch with clients in that group, to make sure that they will be less likely to churn.</p>
<div id="attachment_10107" style="width: 610px" class="wp-caption aligncenter"><img decoding="async" aria-describedby="caption-attachment-10107" class="wp-image-10107 size-fusion-600" src="https://www.cybiant.com/wp-content/uploads/2020/03/Churn-Attributes-600x412.png" alt="Churn-Variables-Power-BI" width="600" height="412" srcset="https://www.cybiant.com/wp-content/uploads/2020/03/Churn-Attributes-200x137.png 200w, https://www.cybiant.com/wp-content/uploads/2020/03/Churn-Attributes-300x206.png 300w, https://www.cybiant.com/wp-content/uploads/2020/03/Churn-Attributes-400x275.png 400w, https://www.cybiant.com/wp-content/uploads/2020/03/Churn-Attributes-474x324.png 474w, https://www.cybiant.com/wp-content/uploads/2020/03/Churn-Attributes-500x344.png 500w, https://www.cybiant.com/wp-content/uploads/2020/03/Churn-Attributes-600x412.png 600w, https://www.cybiant.com/wp-content/uploads/2020/03/Churn-Attributes-700x481.png 700w, https://www.cybiant.com/wp-content/uploads/2020/03/Churn-Attributes-768x528.png 768w, https://www.cybiant.com/wp-content/uploads/2020/03/Churn-Attributes-800x550.png 800w, https://www.cybiant.com/wp-content/uploads/2020/03/Churn-Attributes.png 885w" sizes="(max-width: 600px) 100vw, 600px" /><p id="caption-attachment-10107" class="wp-caption-text">Figure 2 &#8211; Attribute Selection for Predicting Customer Churn</p></div>
</div></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-19 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:1.92%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:1.92%;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:1.92%;--awb-spacing-left-medium:1.92%;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-text fusion-text-30" style="--awb-font-size:25px;--awb-text-transform:none;--awb-text-font-family:&quot;Poppins&quot;;--awb-text-font-style:normal;--awb-text-font-weight:700;"><p style="text-align: justify;">Correlation of predictive variables</p>
</div><div class="fusion-text fusion-text-31" style="--awb-text-transform:none;"><p style="text-align: justify;">Thanks to the information gain values we know more about the coherence of the attributes with the churn variable. To see how the attributes relate to each other, we have created a plot of “correlation” between the attributes, which is shown in figure 3. The plot below shows how much the attributes are related to each other. The values differ from 1(perfect correlation) to 0(no correlation at all). It is not a correlation as one would normally see with numerical variables, but a different measure for categorical variables, which is called Cramer’s V. In the attribute importance chart can be observed that OnlineSecurity and TechSupport are both important attributes. In the chart above we can see that those are highly correlated. Therefore, it is plausible that one causes another or the two go almost always together. Then they can be viewed as a one combined phenomenon. Besides the value for predicting churn, this chart can be used to develop a better understanding of the behaviour of customers. It can be easily observed which attributes are interrelated.</p>
<p style="text-align: justify;">We used the powerful combination of Power-BI to make business-oriented insightful visualizations on the data. Power BI is used to gather everything in one place, and make visual reports. R provides us with the flexibility to perform more custom-made operations on the data, for the needs of the specific company. The next step is to identify customers who are likely to churn beforehand. In our view both uses of the data described above are essential. And the predictive part would not be complete without understanding the data.</p>
<div id="attachment_10124" style="width: 610px" class="wp-caption aligncenter"><img decoding="async" aria-describedby="caption-attachment-10124" class="wp-image-10124 size-fusion-600" src="https://www.cybiant.com/wp-content/uploads/2020/03/Correlation-between-attributes-600x531.png" alt="Correlation between attributes" width="600" height="531" srcset="https://www.cybiant.com/wp-content/uploads/2020/03/Correlation-between-attributes-200x177.png 200w, https://www.cybiant.com/wp-content/uploads/2020/03/Correlation-between-attributes-300x266.png 300w, https://www.cybiant.com/wp-content/uploads/2020/03/Correlation-between-attributes-400x354.png 400w, https://www.cybiant.com/wp-content/uploads/2020/03/Correlation-between-attributes-500x443.png 500w, https://www.cybiant.com/wp-content/uploads/2020/03/Correlation-between-attributes-600x531.png 600w, https://www.cybiant.com/wp-content/uploads/2020/03/Correlation-between-attributes-700x620.png 700w, https://www.cybiant.com/wp-content/uploads/2020/03/Correlation-between-attributes.png 722w" sizes="(max-width: 600px) 100vw, 600px" /><p id="caption-attachment-10124" class="wp-caption-text">Figure 3 &#8211; Correlation between attributes</p></div>
</div></div></div><div class="fusion-layout-column fusion_builder_column fusion-builder-column-20 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:1.92%;--awb-margin-bottom-large:20px;--awb-spacing-left-large:1.92%;--awb-width-medium:100%;--awb-order-medium:0;--awb-spacing-right-medium:1.92%;--awb-spacing-left-medium:1.92%;--awb-width-small:100%;--awb-order-small:0;--awb-spacing-right-small:1.92%;--awb-spacing-left-small:1.92%;"><div class="fusion-column-wrapper fusion-column-has-shadow fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-text fusion-text-32" style="--awb-font-size:25px;--awb-text-transform:none;--awb-text-font-family:&quot;Poppins&quot;;--awb-text-font-style:normal;--awb-text-font-weight:700;"><p style="text-align: justify;">Read more or download the white paper</p>
</div><div class="fusion-text fusion-text-33" style="--awb-text-transform:none;"><p style="text-align: justify;">Want to learn more about how to predict customer churn, or want to dive into the formulas and underlying theoretical model? Our <a href="https://www.cybiant.com/portfolio-items/predicting-customer-churn-in-power-bi/">Predicting Customer Churn Case Study</a> provides an even more in-depth look at at the steps that were taken to generate these results.</p>
</div></div></div></div></div>
<p>The post <a href="https://www.cybiant.com/new-case-study-predicting-customer-churn-in-power-bi/">New Case Study: Predicting Customer Churn in Power BI</a> appeared first on <a href="https://www.cybiant.com">Cybiant</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.cybiant.com/new-case-study-predicting-customer-churn-in-power-bi/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>An Introduction to the Hadoop Ecosystem</title>
		<link>https://www.cybiant.com/introduction-to-the-hadoop-ecosystem/</link>
					<comments>https://www.cybiant.com/introduction-to-the-hadoop-ecosystem/#respond</comments>
		
		<dc:creator><![CDATA[Jan-Willem Middelburg]]></dc:creator>
		<pubDate>Wed, 25 Sep 2019 03:20:31 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[HDFS]]></category>
		<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[YARN]]></category>
		<guid isPermaLink="false">https://www.cybiant.com/?p=4104</guid>

					<description><![CDATA[<p>An Introduction to the Hadoop Ecosystem Everyone who is learning more about Big Data will, within a short period of time, encounter the Hadoop software framework. When I first encountered it, I found the ecosystem (all with highly interesting names) very elaborate, yet at the same time confusing. The many components (or to stick  [...]</p>
<p>The post <a href="https://www.cybiant.com/introduction-to-the-hadoop-ecosystem/">An Introduction to the Hadoop Ecosystem</a> appeared first on <a href="https://www.cybiant.com">Cybiant</a>.</p>
]]></description>
										<content:encoded><![CDATA[<div class="fusion-fullwidth fullwidth-box fusion-builder-row-10 fusion-flex-container nonhundred-percent-fullwidth non-hundred-percent-height-scrolling" style="--awb-border-radius-top-left:0px;--awb-border-radius-top-right:0px;--awb-border-radius-bottom-right:0px;--awb-border-radius-bottom-left:0px;--awb-flex-wrap:wrap;" ><div class="fusion-builder-row fusion-row fusion-flex-align-items-flex-start fusion-flex-content-wrap" style="max-width:calc( 1200px + 0px );margin-left: calc(-0px / 2 );margin-right: calc(-0px / 2 );"><div class="fusion-layout-column fusion_builder_column fusion-builder-column-21 fusion_builder_column_1_1 1_1 fusion-flex-column" style="--awb-bg-size:cover;--awb-width-large:100%;--awb-margin-top-large:0px;--awb-spacing-right-large:0px;--awb-margin-bottom-large:0px;--awb-spacing-left-large:0px;--awb-width-medium:100%;--awb-spacing-right-medium:0px;--awb-spacing-left-medium:0px;--awb-width-small:100%;--awb-spacing-right-small:0px;--awb-spacing-left-small:0px;"><div class="fusion-column-wrapper fusion-flex-justify-content-flex-start fusion-content-layout-column"><div class="fusion-text fusion-text-34"><h1>An Introduction to the Hadoop Ecosystem</h1>
<p>Everyone who is learning more about Big Data will, within a short period of time, encounter the Hadoop software framework. When I first encountered it, I found the ecosystem (all with highly interesting names) very elaborate, yet at the same time confusing. The many components (or to stick to Apache terminology ‘related projects’) sometimes add to the confusion, because each of the ‘related project’ has different focus or orientation. In this article, I will therefore aim to provide a comprehensive overview of the Hadoop Ecosystem as I have encountered it.</p>
<h2>Background, History and Objectives of the Apache Hadoop Project</h2>
<p>Before we start to explore the Hadoop Ecosystem in further detail, it would be good to understand why Hadoop was created? As with all open-source inventions that scale quickly, Hadoop clearly solves a specific problem. So what is the problem that led to the creation of Hadoop?</p>
<p>Hadoop was originally created by Doug Cutting and Mike Cafarella in 2005, when they were working on the <a href="https://nutch.apache.org/">Nutch Search Engine</a> project. The Nutch Search Engine project is highly extendible and scalable web crawler, used the search and index web pages. In order to search and index the billions of pages that exist on the internet, vast computing resources are required. Instead of relying on (centralized) hardware to deliver high-availability, Cutting and Cafarella developed software (now known as Hadoop) that was able to distribute the workload across clusters of computers.</p>
<p>With the development of Hadoop, Cutting and Cafarella solved the following problems:</p>
<ol>
<li>Instead of using centralized hardware, Hadoop distributes the workload over a cluster (i.e. a network of computers) which consist of commodity hardware. Because commodity hardware can be used, Hadoop provides a <strong>cost-effective solution</strong> for companies that need to process Big Data.</li>
<li>Hadoop is designed to scale up (and down) from single servers to thousands of machines, making it a highly efficient for batch processing. In most enterprise organizations, high-processing workloads are only required for relatively small periods of time (i.e. not continuously). The <strong>scalability</strong> of Hadoop extremely suitable for batch processing, and consequently for users of Cloud Computing environments, which frequently work on a pay-for-use model.</li>
<li>Because Hadoop distributes workloads over large clusters of commodity hardware, the library has been designed to detect and handle failures at the application layer. If for whatever reason the nodes in the network fail, Hadoop has redundancy built-in that will still complete the processing job. This makes Hadoop <strong>more resilient</strong> or fault-tolerant compared to traditional (centralized) solutions.</li>
</ol>
<p>When you think about these benefits, it is not difficult to see why Hadoop has quickly grown to become the industry standard for distributed storage and processing. Add to these benefits that Hadoop is an open-source solution (i.e. non-proprietary), and you will understand why it has been embraced by almost all enterprise organizations.</p>
<p>And the name Hadoop? This remains one of the most simple, yet fascinating stories in computing. Hadoop was named after the yellow-stuffed toy elephant of the 2-year old son of Doug Cutting, who was just beginning to talk and called the elephant ‘Hadoop.’ In the words of Doug Cutting:</p>
<blockquote>
<p><em>“The name my kid gave a stuffed yellow elephant. Short, relatively easy to spell and pronounce, meaningless, and not used elsewhere: those are my naming criteria. Kids are good at generating such. Googol is a kid’s term.”</em></p>
</blockquote>
<h2>The Core Components of Hadoop: Common, HDFS, MapReduce and YARN.</h2>
<p>When we take a more in-depth look at the Hadoop ecosystem, we can see that Hadoop can be broken down into four main components. These four components (which are in fact independent software modules) together compose an Hadoop cluster. In other words, when people are talking about ‘Hadoop,’ the actually mean that they are using (at least) these 4 components:</p>
<ol>
<li><strong>Hadoop Common</strong> – Contains libraries and utilities need by all the other Hadoop Modules;</li>
<li><strong>Hadoop Distributed File System</strong> – A distributed file-system that stores data on commodity machines, providing high aggregate bandwidth across a cluster;</li>
<li><strong>Hadoop MapReduce</strong> &#8211; A programming model for large scale data processing;</li>
<li><strong>Hadoop YARN</strong> – A resource-management platform responsible for managing compute resources in clusters and using them for scheduling of users&#8217; applications.</li>
</ol>
<p>In the next paragraphs, we will provide an overview of these four main components.</p>
<div id="attachment_4105" style="width: 610px" class="wp-caption aligncenter"><img decoding="async" aria-describedby="caption-attachment-4105" class="wp-image-4105 size-600" src="https://www.cybiant.com/wp-content/uploads/2019/09/Hadoop-Ecosystem-600x450.jpg" alt="Hadoop Ecosystem | Introduction to the Hadoop Ecosystem" width="600" height="450" srcset="https://www.cybiant.com/wp-content/uploads/2019/09/Hadoop-Ecosystem-200x150.jpg 200w, https://www.cybiant.com/wp-content/uploads/2019/09/Hadoop-Ecosystem-300x225.jpg 300w, https://www.cybiant.com/wp-content/uploads/2019/09/Hadoop-Ecosystem-400x300.jpg 400w, https://www.cybiant.com/wp-content/uploads/2019/09/Hadoop-Ecosystem-500x375.jpg 500w, https://www.cybiant.com/wp-content/uploads/2019/09/Hadoop-Ecosystem-600x450.jpg 600w, https://www.cybiant.com/wp-content/uploads/2019/09/Hadoop-Ecosystem-700x525.jpg 700w, https://www.cybiant.com/wp-content/uploads/2019/09/Hadoop-Ecosystem-768x576.jpg 768w, https://www.cybiant.com/wp-content/uploads/2019/09/Hadoop-Ecosystem-800x600.jpg 800w, https://www.cybiant.com/wp-content/uploads/2019/09/Hadoop-Ecosystem.jpg 960w" sizes="(max-width: 600px) 100vw, 600px" /><p id="caption-attachment-4105" class="wp-caption-text">Overview of the Hadoop Core Components &#8211; from <a href="https://www.cybiant.com/training/certified-hadoop-professional/">Certified Hadoop Professional</a> Course</p></div>
<h3>Hadoop Common</h3>
<p>Hadoop is a Java-based solution, meaning it has been written in Java. Hadoop Common provides the tools (in Java) for a user’s computer so that it can read the data that is stored in a Hadoop file system (discussed next). No matter what type of operating system a user has (Windows, Linux, etc.), Hadoop Common ensures that data can be correctly interpreted by the machine.</p>
<h3>Hadoop Distributed File System (HDFS)</h3>
<p>The Hadoop Distributed File System (HDFS) is a filesystem designed for storing very large files with streaming data access patterns, running on clusters of commodity hardware.<a href="#_ftn1" name="_ftnref1">[1]</a> This means that Hadoop stores files that are typically many terabytes up to petabytes of data. The streaming nature of HDFS means that HDFS stores data under the assumption that it will need to be read multiple times and that the speed with which the data can be read is most important. Lastly, HDFS is designed to run on commodity hardware, which is inexpensive hardware that than be sourced from different vendors.</p>
<p>In order to achieve these properties, HDFS breaks data down into smaller ‘blocks,’ typically of 64MB. Because of the abstraction towards blocks, there is no requirement should be stored on the same disk. Instead, they can be stored anywhere. And that is exactly what HDFS does. It stores data on different locations in network (i.e. a cluster). For that reason, it is referred to a <em>distributed</em> file system.</p>
<p>Because the blocks are stored in a cluster, the questions of fault tolerance rises? What happens if one of the connections in the network fails? Does this means that the data becomes incomplete? To address this potential problem of <em>distributed</em> storage, HDFS stores multiple (typically three) redundant copies of each block in the network. If a block for whatever reason becomes unavailable, a copy can be read from an alternative location. Due to this useful property, HDFS is a very fault-tolerant or robust storage system.</p>
<h3>Hadoop MapReduce</h3>
<p>MapReduce is a processing technique and program model that enables distributed processing of large quantities of data, in parallel, on large clusters of commodity hardware. Similar in the way that HDFS stores blocks in a distributed manner, MapReduce processes data in a distributed manner. In other words, MapReduce uses processing power in local nodes within the cluster, instead of centralized processing.</p>
<p>In order to accomplish this, a processing query needs to be expressed as a MapReduce job. A MapReduce job work by breaking down the processing into two distinct phases: the ‘Map’ operation and the ‘Reduce’ operation. The Map operation and takes in a set of data and subsequently converts that data in a new data set, where individual emblements are broken down into key/value pairs.</p>
<p>The output of the Map function is processed by the MapReduce framework, before being sent to the Reduce operation. This processing is sort and groups the key-value pairs, a process that is also known as <em>shuffeling.</em> Shuffling is technically embedded in the Reduce operation. The Reduce operation subsequently processes the (shuffled) output data from the Map operation and converts this into a smaller set of key/value pairs. This is the end results, which is the output of the MapReduce operation.</p>
<p>In summary, we could say that the MapReduce executes in three stages:</p>
<ul>
<li><strong>Map stage</strong>− The goal of the map operation is to process the input data. Generally the input data is in the form of file or directory and is stored in the Hadoop file system (HDFS). The input file is passed to the mapper function line by line. The map operation processes the data and creates several small chunks of data.</li>
<li><strong>Shuffle stage </strong>– The goal of the shuffle operation is to order and sort key/value pairs so that they are ordered into the right sequence.</li>
<li><strong>Reduce stage</strong>−  The goal of the Reduce operation is to process the data that comes from the Map operation. After processing, it produces a new set of output, which will be stored in the HDFS.</li>
</ul>
<p>The main benefit of using MapReduce is that it is able to scale quickly over large networks of computing nodes, making the processing highly efficient and quick.</p>
<h3>Hadoop YARN</h3>
<p>Hadoop YARN (Yet Another Resource Negotiator) is responsible for allocating system resources to the various applications running in a Hadoop cluster and scheduling tasks to be executed on different cluster nodes. It was developed because in very large clusters (with more than 4000 nodes), the MapReduce system begins to hit scalability bottlenecks.</p>
<p>Hadoop YARN solves the scalability problem by introducing a <em>resource manager </em>that manages the use of resources across a cluster. The resource manager manages the responsibilities of the JobTracker, which on its own turn schedules jobs (which data is processed when) as well as task monitoring (which processing jobs have been completed). With the addition of YARN to Hadoop, the scalability of processing data with Hadoop becomes virtually endless.</p>
<h2>Other components of the Hadoop Ecosystem</h2>
<p>Besides the 4 core components of Hadoop (Common, HDFS, MapReduce and YARN), the Hadoop Ecosystem has greatly developed with other tools and solutions that completement the 4 main component. Some of the more popular solutions are Pig, Hive, HBase, ZooKeeper and Sqoop. Each of these components are discussed in separate articles in further detail.</p>
<p><a href="#_ftnref1" name="_ftn1">[1]</a> The architecture of HDFS is described in “The Hadoop Distributed File System” by Konstantin Shvachko, Hairong Kuang, Sanjay Radia, and Robert Chansler (Proceedings of MSST2010, May 2010, http:// storageconference.org/2010/Papers/MSST/Shvachko.pdf).</p>
</div><div ><a class="fusion-button button-flat fusion-button-default-size button-default fusion-button-default button-4 fusion-button-span-yes fusion-button-default-type" target="_self" href="https://www.cybiant.com/education/certified-hadoop-professional/"><span class="fusion-button-text awb-button__text awb-button__text--default">Read more about Certified Hadoop Professional Training</span></a></div></div></div></div></div>
<p>The post <a href="https://www.cybiant.com/introduction-to-the-hadoop-ecosystem/">An Introduction to the Hadoop Ecosystem</a> appeared first on <a href="https://www.cybiant.com">Cybiant</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.cybiant.com/introduction-to-the-hadoop-ecosystem/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
