<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>data science Archives - KDD Analytics</title>
	<atom:link href="https://www.kddanalytics.com/tag/data-science/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.kddanalytics.com/tag/data-science/</link>
	<description>Data to Decisions</description>
	<lastBuildDate>Sat, 24 Mar 2018 02:46:29 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9</generator>

<image>
	<url>https://i0.wp.com/www.kddanalytics.com/wp-content/uploads/2016/08/cropped-imageedit_1_7939659602.png?fit=32%2C32&#038;ssl=1</url>
	<title>data science Archives - KDD Analytics</title>
	<link>https://www.kddanalytics.com/tag/data-science/</link>
	<width>32</width>
	<height>32</height>
</image> 
<site xmlns="com-wordpress:feed-additions:1">114932494</site>	<item>
		<title>Practical Time Series Forecasting &#8211; Data Science Taxonomy</title>
		<link>https://www.kddanalytics.com/practical-time-series-forecasting-data-science-taxonomy/</link>
		
		<dc:creator><![CDATA[KDD]]></dc:creator>
		<pubDate>Tue, 02 Jan 2018 12:26:19 +0000</pubDate>
				<category><![CDATA[Data Analytics Methods]]></category>
		<category><![CDATA[Econometrics]]></category>
		<category><![CDATA[Forecasting]]></category>
		<category><![CDATA[Time Series]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[regression]]></category>
		<category><![CDATA[time series]]></category>
		<guid isPermaLink="false">http://www.kddanalytics.com/?p=1229</guid>

					<description><![CDATA[<p>“Big data is not about the data.*” ― Gary King, Harvard University (*It&#8217;s about the analytics.) Machine Learning. Deep Learning. Data Science. Artificial Intelligence. Big Data. Not a day goes by that one or all of these buzzwords stream past in our business news feeds. Data analytics has become mainstream. And you better jump on&#8230;</p>
<p>The post <a href="https://www.kddanalytics.com/practical-time-series-forecasting-data-science-taxonomy/">Practical Time Series Forecasting &#8211; Data Science Taxonomy</a> appeared first on <a href="https://www.kddanalytics.com">KDD Analytics</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>“Big data is not about the data.*”<br />
― <strong>Gary King, Harvard University</strong></p>
<p>(*<strong><a href="https://www.slideshare.net/BernardMarr/big-data-best-quotes/3-Big_data_is_notabout_the" target="_blank" rel="noopener">It&#8217;s about the analytics</a></strong>.)</p>
<p><strong>Machine Learning</strong>. <strong>Deep Learning</strong>. <strong>Data Science</strong>. <strong>Artificial Intelligence</strong>. <strong>Big Data</strong>.</p>
<p>Not a day goes by that one or all of these buzzwords stream past in our business news feeds.</p>
<p><strong>Data analytics has become mainstream</strong>. And you better jump on board or risk being left at the station!</p>
<p>Just within the last year or so, <strong>searches</strong> of these topics have taken off. In fact, according to Google, in early 2017, search interest in one of these topics, <strong>machine learning, has eclipsed that of big data</strong>:</p>
<p><img data-recalc-dims="1" decoding="async" loading="lazy" class="aligncenter wp-image-1230 size-large" src="https://i0.wp.com/www.kddanalytics.com/wp-content/uploads/2017/12/Google-Search-Machine-Learning-11_11_2012-to-11_11_2017.png?resize=1024%2C329&#038;ssl=1" alt="Google Search Machine Learning" width="1024" height="329" srcset="https://i0.wp.com/www.kddanalytics.com/wp-content/uploads/2017/12/Google-Search-Machine-Learning-11_11_2012-to-11_11_2017.png?resize=1024%2C329&amp;ssl=1 1024w, https://i0.wp.com/www.kddanalytics.com/wp-content/uploads/2017/12/Google-Search-Machine-Learning-11_11_2012-to-11_11_2017.png?resize=300%2C96&amp;ssl=1 300w, https://i0.wp.com/www.kddanalytics.com/wp-content/uploads/2017/12/Google-Search-Machine-Learning-11_11_2012-to-11_11_2017.png?resize=768%2C247&amp;ssl=1 768w, https://i0.wp.com/www.kddanalytics.com/wp-content/uploads/2017/12/Google-Search-Machine-Learning-11_11_2012-to-11_11_2017.png?w=1233&amp;ssl=1 1233w" sizes="auto, (max-width: 1000px) 100vw, 1000px" /></p>
<p>So, how do <strong>time series methods for forecasting</strong> fit into the taxonomy that currently defines the data science field?</p>
<h3>Data science taxonomy</h3>
<p>Key data science terms that are related to time series methods for forecasting are <strong><a href="https://www.datasciencecentral.com/profiles/blogs/data-mining-what-why-when">data mining</a></strong>, <a href="https://www.datasciencecentral.com/profiles/blogs/18-great-articles-about-predictive-analytics"><strong>predictive analytics</strong></a>, <a href="https://www.datasciencecentral.com/profiles/blogs/machine-learning-summarized-in-one-picture"><strong>machine learning</strong></a> (supervised and unsupervised), <a href="https://en.wikipedia.org/wiki/Linear_regression"><strong>regression</strong></a>, <strong>structured</strong> and <a href="https://en.wikipedia.org/wiki/Unstructured_data"><strong>unstructured</strong></a> data.</p>
<p>These are not necessarily mutually exclusive. At the risk of incurring the wrath of the data science gods, <strong>here is our simplification</strong>:</p>
<h4>Structured vs. unstructured data</h4>
<p>Structured data are organized into “rows and columns” (spreadsheet); unstructured data are not (text in a book).</p>
<p style="text-align: center;"><span style="color: #60786b;"><strong>Time series methods use structured data</strong>.</span></p>
<h4>Data mining</h4>
<p>Data mining seeks to find patterns in data, whether structured or unstructured.</p>
<p style="text-align: center;"><span style="color: #60786b;"><strong>Time series methods seek to find patterns that repeat over time</strong>.</span></p>
<h4>Predictive analytics</h4>
<p>Predictive analytics seeks to find a relationship between a variable of interest (e.g. customer churn) and multiple dimensions (e.g. age, length of contract, zip code). These dimensions can be used to predict the likelihood of a customer churning (in our example).</p>
<p>Typically, predictive analytics is not based on time series data but &#8220;cross-sectional&#8221; data like a customer set. Additionally, time series methods use only a very limited set of dimensions, the primary one being past behavior of the variable being forecasted (e.g. sales).</p>
<p style="text-align: center;"><span style="color: #60786b;"><strong>Time series methods typically use the past behavior of the variable being forecasted as the primary dimension.</strong></span></p>
<h4>Machine learning</h4>
<p>Machine learning means that a computer is using a program (algorithm) to “connect the dots” in the data. <strong>If you run a regression model in Excel you are engaging in machine learning.</strong></p>
<p>However, <span style="text-decoration: underline;">supervised</span> machine learning does not mean you are keeping watch over Excel as it does its stuff!</p>
<div id="attachment_1232" style="width: 310px" class="wp-caption alignright"><img data-recalc-dims="1" decoding="async" aria-describedby="caption-attachment-1232" loading="lazy" class="wp-image-1232 size-medium" src="https://i0.wp.com/www.kddanalytics.com/wp-content/uploads/2017/12/46961317_s.jpg?resize=300%2C200&#038;ssl=1" alt="supervised machine learning?" width="300" height="200" srcset="https://i0.wp.com/www.kddanalytics.com/wp-content/uploads/2017/12/46961317_s.jpg?resize=300%2C200&amp;ssl=1 300w, https://i0.wp.com/www.kddanalytics.com/wp-content/uploads/2017/12/46961317_s.jpg?w=450&amp;ssl=1 450w" sizes="auto, (max-width: 300px) 100vw, 300px" /><p id="caption-attachment-1232" class="wp-caption-text">This is NOT what &#8220;supervised&#8221; machine learning means!</p></div>
<p><strong>Supervised machine learning means</strong> that the computer is seeking to find a relationship between a single variable (e.g. churn) and many dimensional variables (e.g. age, length of contract, zip code).</p>
<p><strong>Unsupervised machine learning</strong> <strong>means</strong> that the computer is seeking to find a relationship between many dimensions (e.g. age, length of contract, zip code) so that customers can, for example, be clustered into a small number of groups or tribes with similar characteristics.</p>
<p style="text-align: center;"><span style="color: #60786b;"><strong>Time series methods are a type of supervised machine learning since they attempt to find a relationship between present and past behavior</strong>.</span></p>
<h4>Regression</h4>
<p>Regression is one way a machine finds relationships between a single variable and a few (or many) dimensional variables or past values of the variable itself. There are several flavors of regression.</p>
<p style="text-align: center;"><span style="color: #60786b;"><strong> Time series models typically use <a style="color: #60786b;" href="https://en.wikipedia.org/wiki/Least_squares">least squares</a> regression or <a style="color: #60786b;" href="https://en.wikipedia.org/wiki/Maximum_likelihood_estimation">maximum likelihood</a></strong>.</span></p>
<h3>Bottom line</h3>
<p>So, when you use time series methods for forecasting you are probably <strong>mining structured data using supervised, regression- or maximum likelihood-based, machine learning</strong>.</p>
<a class="dpsp-click-to-tweet dpsp-style-1" href="https://twitter.com/intent/tweet?text=%E2%80%9CBig+data+is+not+about+the+data.%E2%80%9D&url=https%3A%2F%2Fwww.kddanalytics.com%2Fpractical-time-series-forecasting-data-science-taxonomy%2F"><div class="dpsp-click-to-tweet-content">“Big data is not about the data.”</div><div class="dpsp-click-to-tweet-footer"><span class="dpsp-click-to-tweet-cta"><span>Click to Tweet</span><i class="dpsp-network-btn dpsp-twitter"><span class="dpsp-network-icon"></span></i></span></div></a>
<p><a href="https://www.kddanalytics.com/practical-time-series-forecasting-introduction/" target="_blank" rel="noopener"><strong>Part 1 &#8211; Practical Time Series Forecasting &#8211; Introduction</strong></a></p>
<p><a href="https://www.kddanalytics.com/practical-time-series-forecasting-basics/" target="_blank" rel="noopener"><strong>Part 2 &#8211; Practical Time Series Forecasting &#8211; Some Basics</strong></a></p>
<p><a href="https://www.kddanalytics.com/practical-time-series-forecasting-useful-models/" target="_blank" rel="noopener"><strong>Part 3 &#8211; Practical Time Series Forecasting &#8211; Potentially Useful Models</strong></a></p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>The post <a href="https://www.kddanalytics.com/practical-time-series-forecasting-data-science-taxonomy/">Practical Time Series Forecasting &#8211; Data Science Taxonomy</a> appeared first on <a href="https://www.kddanalytics.com">KDD Analytics</a>.</p>
]]></content:encoded>
					
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">1229</post-id>	</item>
	</channel>
</rss>
