tag:blogger.com,1999:blog-71352672657981153562024-03-18T00:14:54.369-07:00Pentaho Data IntegrationPentaho Kettle Data Integration Installation for Windows, Pentaho Data Integration ( PDI ) - Generic Design GuidelinesAravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.comBlogger27125tag:blogger.com,1999:blog-7135267265798115356.post-17733883926490382472022-11-25T00:03:00.005-08:002022-11-25T00:05:00.216-08:00Pentaho Data Integration - Get file names step<p> The Get Filenames step allows you to retrieve information associated with filenames in the file system. The obtained file name is added to the stream as a line. Search for files using wildcard (RegExp) fields</p><p> Stepwise illustration on how to use "Get file names" step given below.</p><p></p><div class="separator" style="clear: both; text-align: left;"><a href="https://blogger.googleusercontent.com/img/a/AVvXsEg-dm8u9yYIm4RM5697imM5awPG4ShN5mvxnwaWK_Ip3YMrRCW__CGIzOoZ6yGuNSgSaxXbv5AKioVbTtfp-z-Kd7CUB9T5y-QZnZeUJKhBU74KbeZmiWCVFWpJfwfbb0oHBdLAEvKSM9_T7EmdzzY_ObLWKYqm9pZ-_kzV0XvIDZ_463w7uydShijz" style="margin-left: 1em; margin-right: 1em;"><img alt="" data-original-height="172" data-original-width="220" height="110" src="https://blogger.googleusercontent.com/img/a/AVvXsEg-dm8u9yYIm4RM5697imM5awPG4ShN5mvxnwaWK_Ip3YMrRCW__CGIzOoZ6yGuNSgSaxXbv5AKioVbTtfp-z-Kd7CUB9T5y-QZnZeUJKhBU74KbeZmiWCVFWpJfwfbb0oHBdLAEvKSM9_T7EmdzzY_ObLWKYqm9pZ-_kzV0XvIDZ_463w7uydShijz=w141-h110" width="141" /></a></div><span><a name='more'></a></span><div class="separator" style="clear: both; text-align: left;"><br /></div><div class="separator" style="clear: both; text-align: left;"><b><u><span style="color: #0b5394;">Configure File tab</span></u></b></div><div class="separator" style="clear: both; text-align: left;"><b><u><span style="color: #0b5394;"><br /></span></u></b></div><div class="separator" style="clear: both; text-align: left;">Enter directory name and click add.</div><div class="separator" style="clear: both; text-align: left;"><br /></div><div class="separator" style="clear: both; text-align: left;"><div class="separator" style="clear: both; text-align: center;"><div class="separator" style="clear: both; text-align: left;"><a href="https://blogger.googleusercontent.com/img/a/AVvXsEhZmjIshmIvVLgUjcOlT66EkETO79JxAdihnovsq4QgXjvegEMqwmpRYIss-mWro4peCn5uxiOL59l9Ph7VpCkCuCNgE8YFcoboN9oOPGzeHIn6wlwU_RhIjGWcxm8M9hDqu1ttHFP3jCgVo-O3w_F2xiPdJVIgZJOomzB7-NGHFMNvrhKdBonJt2S7" style="margin-left: 1em; margin-right: 1em;"><img alt="" data-original-height="190" data-original-width="1574" height="49" src="https://blogger.googleusercontent.com/img/a/AVvXsEhZmjIshmIvVLgUjcOlT66EkETO79JxAdihnovsq4QgXjvegEMqwmpRYIss-mWro4peCn5uxiOL59l9Ph7VpCkCuCNgE8YFcoboN9oOPGzeHIn6wlwU_RhIjGWcxm8M9hDqu1ttHFP3jCgVo-O3w_F2xiPdJVIgZJOomzB7-NGHFMNvrhKdBonJt2S7=w400-h49" width="400" /></a></div><br /><br /></div>Enter ".*.csv" as RegExp Wildcard to get only CSV files.</div><div class="separator" style="clear: both; text-align: left;"><br /></div><div class="separator" style="clear: both; text-align: left;"><div class="separator" style="clear: both; text-align: left;"><a href="https://blogger.googleusercontent.com/img/a/AVvXsEgSZ_7FcgBE_XEm9d4ZtKddDlQO8QiITABvmBS6mFKUX3XcLqOC6BF_4EtNqSphkttmE1EUgQLW4sv_OXx9FmrPMEvX3P1zJLtdSQQA_6Te9qh6U5fLcA-DJZPlG0QRdxVp0s7Ye53SNaRGqszA8fzH_0SJhcv29d5VwMfuJBfA_A0cLoPT9jc6R0tE" style="margin-left: 1em; margin-right: 1em;"><img alt="" data-original-height="998" data-original-width="2246" height="178" src="https://blogger.googleusercontent.com/img/a/AVvXsEgSZ_7FcgBE_XEm9d4ZtKddDlQO8QiITABvmBS6mFKUX3XcLqOC6BF_4EtNqSphkttmE1EUgQLW4sv_OXx9FmrPMEvX3P1zJLtdSQQA_6Te9qh6U5fLcA-DJZPlG0QRdxVp0s7Ye53SNaRGqszA8fzH_0SJhcv29d5VwMfuJBfA_A0cLoPT9jc6R0tE=w400-h178" width="400" /></a></div><div class="separator" style="clear: both; text-align: left;"><br /></div><br /></div><p></p><div class="separator" style="clear: both;"><b><u><span style="color: #0b5394;">Configure Filter tab</span></u></b></div><div class="separator" style="clear: both;"><div class="separator" style="clear: both;"><b><u><span style="color: #0b5394;"><br /></span></u></b></div><div class="separator" style="clear: both;">Select "All Files" to get both files and folders.</div><div class="separator" style="clear: both;"><br /></div><div class="separator" style="clear: both;"><div class="separator" style="clear: both; text-align: center;"><a href="https://blogger.googleusercontent.com/img/a/AVvXsEierTHAo5SSdxe6fJtMIvAWvi8aZEpDqjxod_Si-K9PQKvUPNjHuaHd8rKRizJqrB--QvLF62_RwJHgW-KpIWNn1W15Re5c0lWmg0IcQXuANlGeCseE8Sw1zE1elLKR9b0VLPwF1M9ZxPP1-h0LdC-vXqz9CHI4ezU0SBJ2oov4hwXZJuTFYj2REtWV" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img alt="" data-original-height="1002" data-original-width="2250" height="179" src="https://blogger.googleusercontent.com/img/a/AVvXsEierTHAo5SSdxe6fJtMIvAWvi8aZEpDqjxod_Si-K9PQKvUPNjHuaHd8rKRizJqrB--QvLF62_RwJHgW-KpIWNn1W15Re5c0lWmg0IcQXuANlGeCseE8Sw1zE1elLKR9b0VLPwF1M9ZxPP1-h0LdC-vXqz9CHI4ezU0SBJ2oov4hwXZJuTFYj2REtWV=w400-h179" width="400" /></a></div><br /><br /></div><div class="separator" style="clear: both;"><br /></div><div class="separator" style="clear: both;"><br /></div><div class="separator" style="clear: both;"><br /></div><div class="separator" style="clear: both;"><span style="color: #0b5394;"><b><u>Preview Rows</u></b></span></div><div class="separator" style="clear: both;"><span style="color: #0b5394;"><b><u><br /></u></b></span></div><div class="separator" style="clear: both;"><span style="color: #0b5394;"><b><u><div class="separator" style="clear: both; text-align: left;"><a href="https://blogger.googleusercontent.com/img/a/AVvXsEglcYBFMS4uuORfjatDKiAuKg3JNUBA941caW43a9Jg__dty3y3DP1fG1MeRGJKjjnc_YGCqQBcv6xaXRUdp7Qlgcm7Zk8-zI8wWNaNO6czrNZQ8Sbr2ocjp61OSkHrJNlSx3pe5Ok3K5eghGC2erdQMuYKHpKOqT-EMog-Ee88-_S5Gagc46KyC19d" style="margin-left: 1em; margin-right: 1em;"><img alt="" data-original-height="636" data-original-width="3160" height="128" src="https://blogger.googleusercontent.com/img/a/AVvXsEglcYBFMS4uuORfjatDKiAuKg3JNUBA941caW43a9Jg__dty3y3DP1fG1MeRGJKjjnc_YGCqQBcv6xaXRUdp7Qlgcm7Zk8-zI8wWNaNO6czrNZQ8Sbr2ocjp61OSkHrJNlSx3pe5Ok3K5eghGC2erdQMuYKHpKOqT-EMog-Ee88-_S5Gagc46KyC19d=w640-h128" width="640" /></a></div><br /><br /></u></b></span></div></div><p><b style="color: #0b5394;"><u>Execution and Results</u></b></p><p><a href="https://blogger.googleusercontent.com/img/a/AVvXsEibbpf240NEfy_ZXcNehoj6HdM6hXafe8Zdn1daTo5jTllDjcMqbv7X0I0dzum-F1J5CeXI1FSTJYu4ReBgemnmyJY1jOoqYBKR17AKCgQWc2gqQ_7_qAA9msX3s1bVAnTMC98CQh3_ibFabdJ28zR1WSU6rYeKdAfk1fQ_CsCToHwvyi3LXVE3HqBp" style="margin-left: 1em; margin-right: 1em;"><img alt="" data-original-height="348" data-original-width="1044" height="134" src="https://blogger.googleusercontent.com/img/a/AVvXsEibbpf240NEfy_ZXcNehoj6HdM6hXafe8Zdn1daTo5jTllDjcMqbv7X0I0dzum-F1J5CeXI1FSTJYu4ReBgemnmyJY1jOoqYBKR17AKCgQWc2gqQ_7_qAA9msX3s1bVAnTMC98CQh3_ibFabdJ28zR1WSU6rYeKdAfk1fQ_CsCToHwvyi3LXVE3HqBp=w400-h134" width="400" /></a></p><p>Complete the flow by adding a dummy step and execute transformation.</p><div style="text-align: left;"><br /></div><div><div><span style="color: #2b00fe; font-family: arial; font-size: x-small;"><i>2022/11/24 23:57:41 - Spoon - Running transformation using the Kettle execution engine</i></span></div><div><span style="color: #2b00fe; font-family: arial; font-size: x-small;"><i>2022/11/24 23:57:41 - Spoon - Transformation opened.</i></span></div><div><span style="color: #2b00fe; font-family: arial; font-size: x-small;"><i>2022/11/24 23:57:41 - Spoon - Launching transformation [Untitled]...</i></span></div><div><span style="color: #2b00fe; font-family: arial; font-size: x-small;"><i>2022/11/24 23:57:41 - Spoon - Started the transformation execution.</i></span></div><div><span style="color: #2b00fe; font-family: arial; font-size: x-small;"><i>2022/11/24 23:57:41 - Untitled - Dispatching started for transformation [Untitled]</i></span></div><div><span style="color: #2b00fe; font-family: arial; font-size: x-small;"><i>2022/11/24 23:57:41 - Get file names.0 - Finished processing (I=0, O=0, R=0, W=13, U=0, E=0)</i></span></div><div><span style="color: #2b00fe; font-family: arial; font-size: x-small;"><i>2022/11/24 23:57:41 - Dummy (do nothing).0 - Finished processing (I=0, O=0, R=13, W=13, U=0, E=0)</i></span></div><div><span style="color: #2b00fe; font-family: arial; font-size: x-small;"><i>2022/11/24 23:57:41 - Spoon - The transformation has finished!!</i></span></div></div><div><br /></div><div><br /></div><div><br /></div><p><b>Results :</b></p><p></p><div class="separator" style="clear: both; text-align: left;"><a href="https://blogger.googleusercontent.com/img/a/AVvXsEhxQ1emv3YUNRV77-2J07WF6G20ue4tRgi3u5EG1w1pWyHif1DQsgTzwhsKiEyYR-675EWuHU6YiPQ7h2zDvAd9ukI-L9rKFxUmXRAcQd1HIXKKXUMhcz7NVtnKmzS2OP5Uwj8t0W9SYh-ZEHlyQegVajR0geOW-Zah3OxrNooFrfb5OqjE8CWNZlFK" style="margin-left: 1em; margin-right: 1em;"><img alt="" data-original-height="908" data-original-width="2486" height="234" src="https://blogger.googleusercontent.com/img/a/AVvXsEhxQ1emv3YUNRV77-2J07WF6G20ue4tRgi3u5EG1w1pWyHif1DQsgTzwhsKiEyYR-675EWuHU6YiPQ7h2zDvAd9ukI-L9rKFxUmXRAcQd1HIXKKXUMhcz7NVtnKmzS2OP5Uwj8t0W9SYh-ZEHlyQegVajR0geOW-Zah3OxrNooFrfb5OqjE8CWNZlFK=w640-h234" width="640" /></a></div><br /><br /><p></p><p><br /></p><p><span style="color: #0b5394;"><b><u><br /></u></b></span><br /></p><p><br /></p><p><br /><br /><br /><br /><br /></p><p></p><div class="separator" style="clear: both; text-align: left;"><br /></div><br /><br /><p></p>Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com0tag:blogger.com,1999:blog-7135267265798115356.post-31391674234306903842022-11-24T01:45:00.002-08:002022-11-24T01:47:21.546-08:00Pentaho Data Integration - Community Edition Install for Mac<p> Pentaho is an end-to-end data integration and analytics platform designed to manage data at scale for rapid business innovation, ease of use, and self-service automation and orchestration. Pentaho tightly ties data integration and business analytics in a modern platform that connects IT and business users to access, visualize, and explore all the data that impacts business outcomes. Pentaho Kettle enables IT and developers to integrate data from different sources and deliver it to business applications. </p><div><div><div class="separator" style="clear: both;"><a href="https://blogger.googleusercontent.com/img/a/AVvXsEhYYqOkHzR5UHaO3QE1xa7QwUInaITGVzZbGt0uWxsmTDOMgwu-OYXC-aIjakMY9Vn93esySWPPl_p9aixxU20GK73mRZdOhsFcoivX0g_CZnNOrGAu26DS6-1ExXICxt1u2y-BlLTKKYt_zAlAk1Ms_sT5gXYboPRrScvyeKsAZGb5PZ7Tkve4fzGu" style="margin-left: 1em; margin-right: 1em;"><img alt="" data-original-height="304" data-original-width="1266" height="34" src="https://blogger.googleusercontent.com/img/a/AVvXsEhYYqOkHzR5UHaO3QE1xa7QwUInaITGVzZbGt0uWxsmTDOMgwu-OYXC-aIjakMY9Vn93esySWPPl_p9aixxU20GK73mRZdOhsFcoivX0g_CZnNOrGAu26DS6-1ExXICxt1u2y-BlLTKKYt_zAlAk1Ms_sT5gXYboPRrScvyeKsAZGb5PZ7Tkve4fzGu=w143-h34" width="143" /></a></div><div><br /></div><div>Step wise illustration on how to install Pentaho Data Integration community edition 9.3.0.0 is given below.</div><span><a name='more'></a></span><div><br /></div><b><u><span style="color: #3d85c6;">Step 1</span></u></b></div><div><b><u><span style="color: #3d85c6;"><br /></span></u></b></div><div>Download Pentaho community edition.</div><div><br /></div><div><a href="https://www.hitachivantara.com/en-us/products/lumada-dataops/data-integration-analytics/pentaho-community-edition.html" target="_blank">Pentaho community edition</a></div><div><br /></div><div><div class="separator" style="clear: both;">Accept license agreements and download "pdi-ce-9.3.0.0-428.zip"</div><div class="separator" style="clear: both;"><br /></div><div class="separator" style="clear: both;"><br class="Apple-interchange-newline" /><span style="color: #3d85c6; font-weight: 700; text-decoration-line: underline;">Step 2</span></div><br />Extract the contents.</div><div><br /></div><div><div class="separator" style="clear: both;"><a href="https://blogger.googleusercontent.com/img/a/AVvXsEhGYLSaW89iwvy8nZAPppmYdU8H9WFWR3foKlzV5fRKhD8kEkQpl78fXc54dcDMZL003Mq8DOB_ldR7upNUW60dMg78GzB0X7q3UucEnmBwFk1GStOn4wkdsB7dBejyqjdbg_gvFqibjdl2mxLBeLxGPrpsmLK5yLOUUgID1Kgz6J7yT4mURILNs20V" style="margin-left: 1em; margin-right: 1em;"><img alt="" data-original-height="578" data-original-width="658" height="240" src="https://blogger.googleusercontent.com/img/a/AVvXsEhGYLSaW89iwvy8nZAPppmYdU8H9WFWR3foKlzV5fRKhD8kEkQpl78fXc54dcDMZL003Mq8DOB_ldR7upNUW60dMg78GzB0X7q3UucEnmBwFk1GStOn4wkdsB7dBejyqjdbg_gvFqibjdl2mxLBeLxGPrpsmLK5yLOUUgID1Kgz6J7yT4mURILNs20V" width="273" /></a></div><div><br /></div>Right click on "Data Integration" and show package contents.</div><div><br /></div><div><div class="separator" style="clear: both;"><br class="Apple-interchange-newline" /><span style="color: #3d85c6; font-weight: 700; text-decoration-line: underline;">Step 3</span></div><br />Execute JavaApplicationStub.</div><div><br /></div><div><div class="separator" style="clear: both;"><a href="https://blogger.googleusercontent.com/img/a/AVvXsEgel24_-YNhJuCDmCsfKDlldi9sbTW8d4dWV116uEwAOMNIsIKFQZjBQptW5kaMITKFj4h2ca6ryHau0ZpZcX2aCEF991pi638NGHeVp9KJ_eT4r30bg7HMJE2M9iioiHLdWHfdZUKGaBNlMzevKQC-M0lvZK2-dS2lq4o-WubohF0XU4oaRWlCa6Nv" style="margin-left: 1em; margin-right: 1em;"><img alt="" data-original-height="506" data-original-width="654" height="240" src="https://blogger.googleusercontent.com/img/a/AVvXsEgel24_-YNhJuCDmCsfKDlldi9sbTW8d4dWV116uEwAOMNIsIKFQZjBQptW5kaMITKFj4h2ca6ryHau0ZpZcX2aCEF991pi638NGHeVp9KJ_eT4r30bg7HMJE2M9iioiHLdWHfdZUKGaBNlMzevKQC-M0lvZK2-dS2lq4o-WubohF0XU4oaRWlCa6Nv" width="310" /></a></div><br /><div class="separator" style="clear: both;"><span style="color: #3d85c6; font-weight: 700; text-decoration-line: underline;">Step 4</span></div><br />Pentaho Data Integration community edition</div><div><br /></div><div><div class="separator" style="clear: both;"><a href="https://blogger.googleusercontent.com/img/a/AVvXsEiZ-4AvYjfCknRaNKNbyxiTI8LnRCYkI9ARA4hin7bPwc2PJQSqKrJPbfDh3bimyoD8EsUXK4KJkII2l9jx3obPxyaBt2ZSlvcebiDj45L4DpVpaKeUQOrhpZJzYrgpcEee6HKrvxeZpUDtis9sPQoRIO01nA9dc1EBEKg2erbserSdwvFKs6MzpZg1" style="margin-left: 1em; margin-right: 1em;"><img alt="" data-original-height="1000" data-original-width="1400" height="229" src="https://blogger.googleusercontent.com/img/a/AVvXsEiZ-4AvYjfCknRaNKNbyxiTI8LnRCYkI9ARA4hin7bPwc2PJQSqKrJPbfDh3bimyoD8EsUXK4KJkII2l9jx3obPxyaBt2ZSlvcebiDj45L4DpVpaKeUQOrhpZJzYrgpcEee6HKrvxeZpUDtis9sPQoRIO01nA9dc1EBEKg2erbserSdwvFKs6MzpZg1" width="320" /></a></div><br /><br /><br /></div><div><div class="separator" style="clear: both;"><span style="color: #3d85c6; font-weight: 700; text-decoration-line: underline;"><br class="Apple-interchange-newline" />Step 5</span></div><br />Spoon application will open up and ready to use.</div><div><br /></div><div><div class="separator" style="clear: both;"><a href="https://blogger.googleusercontent.com/img/a/AVvXsEgQlTwvbMR_IFSPGTjvFd2rQyiC-kXUjbglw_Th38ZKX2vzcOB20XZfXq1u0fVbuV483KdWQpBqcbhoSwIF_ro6oiPU_2LK3iKpA9C-S8BOv6rWYAlAq8G3RD60MpAJ52AyBz7ejYDuUFk3agSdOMhhPvnrPRtX9ZqYxDLK-bL3zLp5IKuLwToZ2L5s" style="margin-left: 1em; margin-right: 1em;"><img alt="" data-original-height="1476" data-original-width="2778" height="170" src="https://blogger.googleusercontent.com/img/a/AVvXsEgQlTwvbMR_IFSPGTjvFd2rQyiC-kXUjbglw_Th38ZKX2vzcOB20XZfXq1u0fVbuV483KdWQpBqcbhoSwIF_ro6oiPU_2LK3iKpA9C-S8BOv6rWYAlAq8G3RD60MpAJ52AyBz7ejYDuUFk3agSdOMhhPvnrPRtX9ZqYxDLK-bL3zLp5IKuLwToZ2L5s" width="320" /></a></div><br /><br /></div><div><br /></div></div>Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com0tag:blogger.com,1999:blog-7135267265798115356.post-75327105632872426922017-04-18T23:56:00.002-07:002018-06-08T20:54:53.498-07:00Pentaho Data Integration - PDI 7.0 Installation for Windows 64 bit <h3 class="post-title entry-title">
</h3>
<div class="post-header">
<div class="post-header-line-1">
</div>
</div>
Pentaho 7 is the latest Pentaho version with powerful features including enhanced big data security features and advanced data exploration functionality.<br />
<br />
Step wise illustration on how to install Pentaho Data Integration 7 is given below.<br />
<br />
Here are some of the highlights of the new version. <br />
<br />
<ul>
<li><span style="color: blue;">Inspect Data in the Pipeline.</span></li>
<li><span style="color: blue;">Advanced Security features for Bigdata including Kerberos.</span></li>
<li><span style="color: blue;">Integrated installation of Business Analytics (BA) and Data Integration (DI) components.</span></li>
<li><span style="color: blue;">Spark submit job entry for scala and python.</span></li>
<li><span style="color: blue;">Expanded Metadata Injection Support. </span></li>
</ul>
<b></b><br />
<a name='more'></a><div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi7s2l62pfdVDpYd4Y52ZPqs-UrWOU_Ypf3o1D_CCofoBUTPaQ031viltHNFw1hz_7KOrRqSoRuERg23oQ27ihLKpI-Gw1wexQzi5RRLr4lvOACVfkybr69k83K59S8_sEI6DqVkursvvk/s1600/pentaho.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" data-original-height="59" data-original-width="58" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi7s2l62pfdVDpYd4Y52ZPqs-UrWOU_Ypf3o1D_CCofoBUTPaQ031viltHNFw1hz_7KOrRqSoRuERg23oQ27ihLKpI-Gw1wexQzi5RRLr4lvOACVfkybr69k83K59S8_sEI6DqVkursvvk/s1600/pentaho.jpg" /></a></div>
<br />
<br />
<br />
<br />
<br />
<b><u>Step 1</u></b><br />
<b></b><br />
Download PDI 7.0 installation software "pentaho-business-analytics-7.0.0-25-x64.exe".<br />
<br />
<a href="http://www.pentaho.com/download">http://www.pentaho.com/download</a><br />
<br />
Execute the installation file.<br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjNmqAvLrTPji4BG0GsUDPb9mBOT9r_2gGzaYgR116qM0VASgqTn4ab6UQC8rjIxI_qTZ6LWJgwLPl1fnr0gHVtA1dGKMJLmdYFgtJqeHbHMSX6AAfNuHLv0gIeKa28_Ddktazqbp2L7l0/s1600/pent_1.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="33" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjNmqAvLrTPji4BG0GsUDPb9mBOT9r_2gGzaYgR116qM0VASgqTn4ab6UQC8rjIxI_qTZ6LWJgwLPl1fnr0gHVtA1dGKMJLmdYFgtJqeHbHMSX6AAfNuHLv0gIeKa28_Ddktazqbp2L7l0/s320/pent_1.png" width="320" /></a><br />
<br />
<br />
<br />
<b><u>Step 2</u></b><br />
<b></b><br />
Pentaho installation wizard will popup and will guide through the installation.<br />
Click Next.<br />
<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjVvPS9ovCXMv_B88vN33sDw_cjUAzw85fSwHg6xZXZ4oXN5Y9IGh3c6dwGBRVJWhS0zJAuEcQRK4MgTB6hTjahSGBP7Z5GCRENWFUH9LwGQhR_P02XgBJifcL_VZ3XX6ZpFfx9fFqCTCw/s1600/pent2.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="280" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjVvPS9ovCXMv_B88vN33sDw_cjUAzw85fSwHg6xZXZ4oXN5Y9IGh3c6dwGBRVJWhS0zJAuEcQRK4MgTB6hTjahSGBP7Z5GCRENWFUH9LwGQhR_P02XgBJifcL_VZ3XX6ZpFfx9fFqCTCw/s400/pent2.png" width="400" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<b><u>Step 3</u></b><br />
<b></b><br />
Accept license agreements.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjBOaP4t9xuONO9JTt3CTFyA5toZnJY4WG7YJBsMBbpzaAV_m3f5_4bQjlUtL0iUSK9v7wkuaTiFo08S0ozoHLr2xrW6Vt9Fg4UKWvP7InRP-Ykja3ABFJ4-X2aq7J4BBO_jgIbu4saXMM/s1600/pent3.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="281" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjBOaP4t9xuONO9JTt3CTFyA5toZnJY4WG7YJBsMBbpzaAV_m3f5_4bQjlUtL0iUSK9v7wkuaTiFo08S0ozoHLr2xrW6Vt9Fg4UKWvP7InRP-Ykja3ABFJ4-X2aq7J4BBO_jgIbu4saXMM/s400/pent3.png" width="400" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<b><u>Step 4</u></b><br />
<b></b><br />
Specify the location to install the Pentaho applications.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjoJYCTA3jQFiuGKmGVQ3sm2lio2igGY3Rco1JGA6-eZ7cDgP0LkamVm1vjYQbeNQ4n_Y65rgslxXIw1Ir4IrGYL1qFUYzgJocZ2UwT3UjFuKnkACXfmOGurHJABdNJu2Gm0ocoq8n0oY8/s1600/pent5.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="281" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjoJYCTA3jQFiuGKmGVQ3sm2lio2igGY3Rco1JGA6-eZ7cDgP0LkamVm1vjYQbeNQ4n_Y65rgslxXIw1Ir4IrGYL1qFUYzgJocZ2UwT3UjFuKnkACXfmOGurHJABdNJu2Gm0ocoq8n0oY8/s400/pent5.png" width="400" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<b><u>Step 5</u></b><br />
<b></b><br />
Postgres will be installed along with PDI to store reports and other user information. Please specify password for postgres.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjXEzok_jbVCAY7WyLxstcUlCvOQYD1HeZVUK2eK5m2-C0EyOT9EQGd67yv1OkAMBx6zEdwr2esCyNTG3rCUVyUjrlFoBEOw3OV14qMifIafnlJDusn995JIknnaMkOc-6HfbB9yHZ5SHo/s1600/pent6.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="278" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjXEzok_jbVCAY7WyLxstcUlCvOQYD1HeZVUK2eK5m2-C0EyOT9EQGd67yv1OkAMBx6zEdwr2esCyNTG3rCUVyUjrlFoBEOw3OV14qMifIafnlJDusn995JIknnaMkOc-6HfbB9yHZ5SHo/s400/pent6.png" width="400" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<b><u>Step 6</u></b><br />
<b></b><br />
Setup ready to install. Click next to proceed.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhC5sakFRqhtFB5NeksV0Rf8zlduMJWV-H5VQueVhgd3kDKsOTj-iT8HWLjc2faoEFsfEQjaQAEwnCDFmOmExDlw72xoucXnC-Bjxs-5akwAyuwgbgN85MdYblS6Ogt41my2Bno9duYrLo/s1600/pent7.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="280" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhC5sakFRqhtFB5NeksV0Rf8zlduMJWV-H5VQueVhgd3kDKsOTj-iT8HWLjc2faoEFsfEQjaQAEwnCDFmOmExDlw72xoucXnC-Bjxs-5akwAyuwgbgN85MdYblS6Ogt41my2Bno9duYrLo/s400/pent7.png" width="400" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<b><u>Step 7</u></b><br />
<b></b><br />
Installation in progress.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiTjib6bUnqco1O1xYl9py0kXi-_1rGE3AfNrZJu6_I-rEkF8uqKv2TBM8hND_ZQH5znfisg8YCgTX61VV6JDqvCBJThkqcrqqPpel_sfr1jvmlRwIl1N350kTsf1YO00qxlo1Ve8zlyls/s1600/pent8.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="280" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiTjib6bUnqco1O1xYl9py0kXi-_1rGE3AfNrZJu6_I-rEkF8uqKv2TBM8hND_ZQH5znfisg8YCgTX61VV6JDqvCBJThkqcrqqPpel_sfr1jvmlRwIl1N350kTsf1YO00qxlo1Ve8zlyls/s400/pent8.png" width="400" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<b><u>Step 8</u></b><br />
<b></b><br />
Starting Pentaho Server.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhojO5pTZBDJ5XCyxRlVnUd1b9BbjB5wvRr9RXlGS66LOyurnenEa9Vug_0_-DLDAGtrlQi8VTe5NHcQpR4EGimQR3xlmTxpPa2I3Sw_644rEe5hyphenhyphenznd4ozhVhWMmCUnqWPm0gqXOiu3vM/s1600/pent9.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="282" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhojO5pTZBDJ5XCyxRlVnUd1b9BbjB5wvRr9RXlGS66LOyurnenEa9Vug_0_-DLDAGtrlQi8VTe5NHcQpR4EGimQR3xlmTxpPa2I3Sw_644rEe5hyphenhyphenznd4ozhVhWMmCUnqWPm0gqXOiu3vM/s400/pent9.png" width="400" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<b><u>Step 9</u></b><br />
<b></b><br />
Installation completed successfully.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhSP9V2SrduQQstjTPr-ZFCCzBWk4Hj49YVD0-M9m35vHBhzDshW05QpGdtHj7IaAPw78e09weMG182nfpVewJRBcWxw3GyUx0pw3Ah-S-iJEpeNECyYlcHXzCeiTYG5bZe_6bsJpPFdRg/s1600/pent10.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="280" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhSP9V2SrduQQstjTPr-ZFCCzBWk4Hj49YVD0-M9m35vHBhzDshW05QpGdtHj7IaAPw78e09weMG182nfpVewJRBcWxw3GyUx0pw3Ah-S-iJEpeNECyYlcHXzCeiTYG5bZe_6bsJpPFdRg/s400/pent10.png" width="400" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<b></b>
<u><b>Installation Summary</b></u><br />
<br />
<span style="color: blue;"><span style="color: orange;"> <i><b>Selections/Settings:</b></i></span><br /><br /> - PostgreSQL server port: 5432<br /> - HSQL server port: 9001<br /> - Pentaho Server (Tomcat) startup port: 8080<br /> - Pentaho Server shutdown port: 8012<br /><br /><span style="color: orange;"><i><b> - Reporting Design Tools:</b></i></span><br /> - Report Designer<br /> - Metadata Editor<br /><br /><span style="color: orange;"><i><b> - Analysis Design Tools:</b></i></span><br /> - Schema Workbench<br /> - Aggregate Table Designer<br /><br /> - Dashboard Designer<br /> - Data Integration (ETL)<br /> - Analyzer<br /> - Interactive Reporting<br /> - Mobile<br /> - Sample Content</span><br />
<br />
<b><u>Step 10</u></b><br />
<b></b><br />
Log into user console.<br />
<br />
<a href="http://www.pentaho.com/download">http://localhost:8080/pentaho/Home</a><br />
<br />
<ul>
<li>Default user : admin</li>
<li>Default Password : password</li>
</ul>
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjc83qpYzuU22TJCec-ZtrySo0ShZlTFToN8ql9iwHNE7ikDJcZojqYazjOeOX3QGt_Bk2s1exWaEGJy-eA7y1GI817Z9bHFy2FkWTzAAIcbu0gmACs1XmOl_jKmWuF6szYAorZLtO2Yow/s1600/pent12.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="192" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjc83qpYzuU22TJCec-ZtrySo0ShZlTFToN8ql9iwHNE7ikDJcZojqYazjOeOX3QGt_Bk2s1exWaEGJy-eA7y1GI817Z9bHFy2FkWTzAAIcbu0gmACs1XmOl_jKmWuF6szYAorZLtO2Yow/s400/pent12.png" width="400" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<b><u>Step 11</u></b><br />
<b></b><br />
Open Spoon.<br />
Click on Design Tools >> Data Integration.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiclPIxMOxBXvNWFmw8WaknKMu7SMKBExVecgSczRFD_hCyqsk9tjvv5cBBSls0oyl1UYIwjLrVg0amrctQugUacJHJtEGKxJYZbuy3Z5iQ8eBnjVXoUAUvzkwRi4wn0BM_LfXZpT5kL4A/s1600/pent15.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiclPIxMOxBXvNWFmw8WaknKMu7SMKBExVecgSczRFD_hCyqsk9tjvv5cBBSls0oyl1UYIwjLrVg0amrctQugUacJHJtEGKxJYZbuy3Z5iQ8eBnjVXoUAUvzkwRi4wn0BM_LfXZpT5kL4A/s1600/pent15.png" /></a></div>
Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com20tag:blogger.com,1999:blog-7135267265798115356.post-10725467334712425162015-10-02T20:33:00.000-07:002015-10-02T20:37:53.982-07:00Pentaho Data Integration : Aggregation using Group By stepThis step can be used to perform various types of aggregations such as sum, average, min, max e.t.c. Input data always need to be sorted for this step to work properly.<br />
<br />
This step support following aggregation methods.<br />
<ol>
<li><span style="color: blue;">Sum</span></li>
<li><span style="color: blue;">Average or Mean<a name='more'></a></span></li>
<li><span style="color: blue;">Median</span></li>
<li><span style="color: blue;">Minimum</span></li>
<li><span style="color: blue;">Maximum</span></li>
<li><span style="color: blue;">Standard deviation</span></li>
<li><span style="color: blue;">Percentile</span></li>
<li><span style="color: blue;">Number of values</span></li>
<li><span style="color: blue;">Number of distinct values </span></li>
<li><span style="color: blue;">Number of rows</span></li>
<li><span style="color: blue;">First non-null value</span></li>
<li><span style="color: blue;">Last non-null value</span></li>
<li><span style="color: blue;">First value null inclusive</span></li>
<li><span style="color: blue;">Last value null inclusive</span></li>
<li><span style="color: blue;">Cumulative sum for all rows</span></li>
<li><span style="color: blue;">Cumulative average for all rows</span></li>
<li><span style="color: blue;">Concatenate strings separated by comma</span></li>
<li><span style="color: blue;">Concatenate strings separated by a value</span>. </li>
</ol>
<br />
Here are the commonly used options for this step.<br />
<br />
<ul>
<li><i><b>Group fields - </b>The fields for Group by.</i><i><b> </b></i></li>
<li><i><b>Aggregates fields -</b> Specify aggregate field names and aggregate methods. </i><i><b> </b></i></li>
<li><i><b>Include all rows -</b> Output will contain all rows instead of just the aggregate rows</i><i><b> </b></i></li>
<li><i><b>Add a line number, Restart in each group - </b> Option to be used along with "Include all rows"</i><i><b> </b></i></li>
<li><i><b>Always give back a result row -</b> To make sure there is an output row in case of no input rows ( for count of 0 ). </i></li>
</ul>
<br />
Step wise illustration on how to use "Group By" step given below.<br />
<br />
<i> </i><br />
<h3>
<span style="color: #cc0000;"><u><b>Step 1 </b></u></span></h3>
Create a demo transformation.<br />
<br />
This demo transformation will read from a CSV input file which contain product information. "Group By" step will be used to aggregate product price based on product category. The input records need to be sorted to use Group BY. A sort step is used for this.<br />
<br />
Here are the configurations for the CSV input and Sort step.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh_17fZpcByfWgvWkZ6BiVpsuWi2PJzLUSLTMiPLPIqHKs0R5XfHeTCwsr2i069VU_1SRJBm3Pf5Wa10rXONdwG4ByAHgkwm4G5lZ0vEs068WvABAcrVeg6CLf6w-yGl1g2M7diZfUrab8/s1600/step1.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="330" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh_17fZpcByfWgvWkZ6BiVpsuWi2PJzLUSLTMiPLPIqHKs0R5XfHeTCwsr2i069VU_1SRJBm3Pf5Wa10rXONdwG4ByAHgkwm4G5lZ0vEs068WvABAcrVeg6CLf6w-yGl1g2M7diZfUrab8/s640/step1.png" width="640" /></a></div>
<h3>
<span style="color: #cc0000;"><u><b>Step 2</b></u></span></h3>
Drag the "Group By" step into the canvas.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgGWrSN3Bcj3pJK4psmnA4CJ0lYKrwbcum3HeemnU5AAKvo1Zge75TfKl88KF9t-HprHqymhRtOelFMrI8QgUAiGTBxCqj5o0eULsm-3H59M5sZ9FFVKqdzziDb-n1vBkyzUvd9CA98rDE/s1600/step2.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="282" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgGWrSN3Bcj3pJK4psmnA4CJ0lYKrwbcum3HeemnU5AAKvo1Zge75TfKl88KF9t-HprHqymhRtOelFMrI8QgUAiGTBxCqj5o0eULsm-3H59M5sZ9FFVKqdzziDb-n1vBkyzUvd9CA98rDE/s640/step2.png" width="640" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<h3>
<span style="color: #cc0000;"><u><b>Step 3</b></u></span></h3>
Set the group by field and aggregate type.<br />
<br />
Category is used as the group by field here for demo purposes. Sum, maximum, minimum, first value and last value selected as aggregate types.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhpfcGIuVEFCIWy2xye345-rkPy4UPUW0jq9O0xG6febsa_JUW-c-lFPVbnJfz3JW8YnqEFG2Q0wdOtH4Zgq2x27ZNBxCrMkBjffPFXichpG2sVZuyhBTENTKrAeprDaF4MelLVbFRQv2g/s1600/step3.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="402" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhpfcGIuVEFCIWy2xye345-rkPy4UPUW0jq9O0xG6febsa_JUW-c-lFPVbnJfz3JW8YnqEFG2Q0wdOtH4Zgq2x27ZNBxCrMkBjffPFXichpG2sVZuyhBTENTKrAeprDaF4MelLVbFRQv2g/s640/step3.png" width="640" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<h3>
<span style="color: #cc0000;"><u><b>Step 4</b></u></span></h3>
Drag the "Dummy" step into the canvas to complete the flow.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjYtSnLt-n_55nQxGs3m4ApbSzzUoyd_lRDnl8RqvS9ANt1NjTGbUfUA1jtdyL1jHFKEbhjlAz1N61Kl7VJsv_LTg2zJk3qDi4rNHu2RysLi6i_7CZqVmLSVEGuns8iHfaotE48M78eH40/s1600/step+4.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="104" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjYtSnLt-n_55nQxGs3m4ApbSzzUoyd_lRDnl8RqvS9ANt1NjTGbUfUA1jtdyL1jHFKEbhjlAz1N61Kl7VJsv_LTg2zJk3qDi4rNHu2RysLi6i_7CZqVmLSVEGuns8iHfaotE48M78eH40/s640/step+4.png" width="640" /></a></div>
<br />
<br />
<br />
<span style="font-size: 16px;"><u><b>Execution Log </b></u></span><i><span style="color: #cc0000; font-size: 12px;"><br /></span></i><br />
<br />
<i><span style="color: #cc0000; font-size: 12px;">trf_Demo_Aggregates - Transformation is pre-loaded.<br />Spoon - Transformation opened.<br />Spoon - Launching transformation [trf_Demo_Aggregates]...<br />Spoon - Started the transformation execution.<br />trf_Demo_Aggregates - Dispatching started for transformation [trf_Demo_Aggregates]<br />trf_Demo_Aggregates - Allocating Steps & StepData...<br />trf_Demo_Aggregates - Transformation is about to allocate step [CSV file input] of type [CsvInput]<br />CSV file input.0 - distribution activated<br />trf_Demo_Aggregates - Transformation is about to allocate step [Sort rows] of type [SortRows]<br />Sort rows.0 - distribution activated<br />trf_Demo_Aggregates - Transformation is about to allocate step [Group by] of type [GroupBy]<br />Group by.0 - distribution activated<br />Group by.0 - Starting allocation of buffers & new threads...<br />Sort rows.0 - The number of binary string to data type conversions done in this sort block is 0<br />Sort rows.0 - Finished processing (I=0, O=0, R=10000, W=10000, U=0, E=0)<br />Group by.0 - Finished processing (I=0, O=0, R=10000, W=26, U=0, E=0)<br />Dummy (do nothing).0 - Finished processing (I=0, O=0, R=26, W=26, U=0, E=0)<br />trf_Demo_Aggregates - Handling extension point for plugin with id 'DataRefineryTransFinishListener' and extension point id 'TransformationFinish'<br />Spoon - The transformation has finished!!</span></i> <br />
<br />
<br />
<br />
<h3>
<u><b>Include All rows - Disabled</b></u></h3>
Results preview where <span style="background-color: yellow;">"Include all rows" option is disabled</span>. Only aggregated rows will be there in the output, in this case one row per category since group by was done based on category.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgTtvO0T3vdjOwlY8nBlCcrw8rvBzah-hVw88XoHMx3tg3R3xHmGJzH2QMeyFsJ4QWTxF58fKzhYLzQN_bMtMQ7DbfeuzSthfEzZS8s2Ytt0vNyzNvpypolauDQC5r8QgAJ_3mKov242VU/s1600/step5.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="468" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgTtvO0T3vdjOwlY8nBlCcrw8rvBzah-hVw88XoHMx3tg3R3xHmGJzH2QMeyFsJ4QWTxF58fKzhYLzQN_bMtMQ7DbfeuzSthfEzZS8s2Ytt0vNyzNvpypolauDQC5r8QgAJ_3mKov242VU/s640/step5.png" width="640" /></a></div>
<br />
<h3>
</h3>
<h3>
<u><b> </b></u></h3>
<h3>
<u><b>Include All rows - Enabled</b></u></h3>
Results preview where <span style="background-color: yellow;">"Include all rows" option is enabled</span>. All input rows will be there in the output, aggregated values such as sum_price, max_price e.t.c will be repeated for the same category in the output.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi00G60dcE3tMJdZrpEcvb6k4l_uk0KsX4qJzd4IDlTJ_OlU4pv-O1KGOUu5v1tERzYjPcU06pNJBAz0EnNeZd9Xm149qG57NoYOEf4lLvrUuHWz551AUTAIbXDROFGzgCnT8OLhDKY-XY/s1600/step6.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="326" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi00G60dcE3tMJdZrpEcvb6k4l_uk0KsX4qJzd4IDlTJ_OlU4pv-O1KGOUu5v1tERzYjPcU06pNJBAz0EnNeZd9Xm149qG57NoYOEf4lLvrUuHWz551AUTAIbXDROFGzgCnT8OLhDKY-XY/s640/step6.png" width="640" /></a></div>
<br />
<br />
<br />
<h3>
<u><b> </b></u></h3>
<h3>
<u><b>Include All rows - Enabled with Line number</b></u></h3>
Results preview where <span style="background-color: yellow;">"Include all rows" option is enabled and "Add line number" is enabled</span>. All rows will be there in the output with a line number for each row.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjMZ_6ZDnT6aRfrqt_jvIbrQfLmx4AST3gHrlu2HeTla5HvNcC6wMdL9jdJieVxsL-rNwbFdWXI5EH0jaUGIdbdBBPhcslxj5-Pz-GPtsuBmUriiqFq8UEmIt09B7peSCQr5Nywu-9ew9Y/s1600/step7.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="288" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjMZ_6ZDnT6aRfrqt_jvIbrQfLmx4AST3gHrlu2HeTla5HvNcC6wMdL9jdJieVxsL-rNwbFdWXI5EH0jaUGIdbdBBPhcslxj5-Pz-GPtsuBmUriiqFq8UEmIt09B7peSCQr5Nywu-9ew9Y/s640/step7.png" width="640" /></a></div>
<br />
<br />
<br />
<h3>
<u><b> </b></u></h3>
<h3>
<u><b>Always give back a result row</b></u></h3>
Add a filter step to the dummy transformation to filter out all input records.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiAdj3TEADbrxLa6QRBxpIbBoGVqwCRItNaZsUtullXT_A6uv6J_esb0uDW-XXVfBQ2oPoxKL-YSq1no9Qai8apA20DD7WPeRYEPdyhvOmPRxpvIOFniiIq1Ftjg4TnVTPE0fGBEm8bZyE/s1600/step8.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="148" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiAdj3TEADbrxLa6QRBxpIbBoGVqwCRItNaZsUtullXT_A6uv6J_esb0uDW-XXVfBQ2oPoxKL-YSq1no9Qai8apA20DD7WPeRYEPdyhvOmPRxpvIOFniiIq1Ftjg4TnVTPE0fGBEm8bZyE/s640/step8.png" width="640" /></a></div>
There will be no results rows if the "Always give back a result row" is disabled. Need to enable this to get output result row with Counts as 0 and other aggregate values as Null.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjcgURZCctjbOkVymL9913YkgbN35IOAsA_nY8XpIjiRTS5fH4xbzTd-zE1cpXVkHlNKTRL4Rvef9j0Wjs4Sx9YSaqy1jGq9enbw_SnDXV-ooz4UVXNKrDXztaZItUVTi5iKqgu6lwXfJc/s1600/step9.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjcgURZCctjbOkVymL9913YkgbN35IOAsA_nY8XpIjiRTS5fH4xbzTd-zE1cpXVkHlNKTRL4Rvef9j0Wjs4Sx9YSaqy1jGq9enbw_SnDXV-ooz4UVXNKrDXztaZItUVTi5iKqgu6lwXfJc/s1600/step9.png" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<span id="goog_1859998616"></span><span id="goog_1859998617"></span> Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com15tag:blogger.com,1999:blog-7135267265798115356.post-8819899047409297012015-09-19T22:09:00.002-07:002015-09-19T22:13:19.720-07:00Pentaho Data Integration - Data Grid Input stepThis step generally used for testing, reference or demo purposes. We can create a static rows in a grid.<br />
<br />
<ul>
<li><span style="color: blue;">Meta tab : Enter field names and meta data info.</span></li>
<li><span style="color: blue;">Data tab : Enter static data in a grid.</span></li>
</ul>
<br />
<span style="color: blue;"><span style="color: black;">Here are the step wise illustrations on how to use Data Grid step.</span></span><br />
<a name='more'></a><br />
<span style="color: blue;"><span style="color: black;"><span style="color: #cc0000;"><u><b>Step 1</b></u></span> </span></span><br />
<br />
<span style="color: blue;"><span style="color: black;"><span style="color: blue;"><span style="color: black;">Create a demo transformation and drag "Data Grid" input step into the canvas.</span></span></span></span><br />
<div class="separator" style="clear: both; text-align: center;">
</div>
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhOHJZi2mHJsEbtM-5bjh6Qxej9g1gy3eaiUHq51x6Yh3mWb_iV2dK1SxSXPO-xqbaK9Z-E5LU0-fuWHw0RWHiaVGivVUBNvIOwg_-BdstkIrSAKiRSNFf-N9gdCJtVn1MMlLIRCdhVjRs/s1600/step1.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="288" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhOHJZi2mHJsEbtM-5bjh6Qxej9g1gy3eaiUHq51x6Yh3mWb_iV2dK1SxSXPO-xqbaK9Z-E5LU0-fuWHw0RWHiaVGivVUBNvIOwg_-BdstkIrSAKiRSNFf-N9gdCJtVn1MMlLIRCdhVjRs/s640/step1.jpg" width="640" /></a></div>
<br />
<div class="separator" style="clear: both; text-align: center;">
</div>
<span style="color: blue;"><span style="color: black;"><span style="color: #cc0000;"><u><b>Step 2</b></u></span></span></span><br />
<br />
<span style="color: blue;"><span style="color: black;"><span style="color: blue;"><span style="color: black;">Meta data information - Enter name, type, format, precision e.t.c.</span></span></span></span><br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgoqtP5AkUmKR5vekZahkavWqqEIUfVBfEK-bAdxfQk0q5dfgQ8H9lI76STGwdC67FmGv5cXPDRG1hNKonHLxUnD3Xd2FDmbL0q1vPa7XZQYuoIoMih5UtMdlFUOVWlYw-UmLYUeartXXY/s1600/step2.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="256" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgoqtP5AkUmKR5vekZahkavWqqEIUfVBfEK-bAdxfQk0q5dfgQ8H9lI76STGwdC67FmGv5cXPDRG1hNKonHLxUnD3Xd2FDmbL0q1vPa7XZQYuoIoMih5UtMdlFUOVWlYw-UmLYUeartXXY/s640/step2.jpg" width="640" /></a></div>
<br />
<span style="color: blue;"><span style="color: black;"><span style="color: #cc0000;"><u><b>Step 3</b></u></span> </span></span><br />
<br />
<span style="color: blue;"><span style="color: black;"><span style="color: blue;"><span style="color: black;">Click on data tab - Enter static rows into the grid.</span></span></span></span><br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgQ0asMzo9-oxP2_4hnlsOKOgfQQ_m0eaFY6PcmZ8U4OIpqA83_zXfYkUOy_pbWJHaD1rjZWpXAUdWUbmU2H82goMcEocJ6WYz0eGUMzMVoQthMAiKOSaFgHWohvQmxSwYaGM0RcpuRJ3I/s1600/step3.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="202" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgQ0asMzo9-oxP2_4hnlsOKOgfQQ_m0eaFY6PcmZ8U4OIpqA83_zXfYkUOy_pbWJHaD1rjZWpXAUdWUbmU2H82goMcEocJ6WYz0eGUMzMVoQthMAiKOSaFgHWohvQmxSwYaGM0RcpuRJ3I/s640/step3.jpg" width="640" /></a></div>
<br />
<span style="color: blue;"><span style="color: black;"><span style="color: #cc0000;"><u><b>Step 4</b></u></span> </span></span><br />
<span style="color: blue;"><span style="color: black;"><span style="color: blue;"><span style="color: black;">Drag a dummy step and complete the transformation flow ( for demo purpose ) .</span></span></span></span><br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgi4z0GQl_PELqX2pQmQ7w6XHIk4UBzmH2wDJdLT7-Z_uKqGp3ULz5htBhwf9Qn2Sek_9GcEr6blQ-G3OQg09A1ABeYbdQmeNSr6rekqKPwoZEHPva67BmWTmfvkEB9KYwHGZftzu60u_g/s1600/step+4.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="272" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgi4z0GQl_PELqX2pQmQ7w6XHIk4UBzmH2wDJdLT7-Z_uKqGp3ULz5htBhwf9Qn2Sek_9GcEr6blQ-G3OQg09A1ABeYbdQmeNSr6rekqKPwoZEHPva67BmWTmfvkEB9KYwHGZftzu60u_g/s640/step+4.jpg" width="640" /></a></div>
<br />
<span style="color: blue;"><span style="color: black;"><span style="color: #cc0000;"><u><b>Step 5</b></u></span> </span></span><br />
Execution results.<br />
<br />
<u><i><b>Preview Data</b></i></u><br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhIrd5dYdFmOsgXEtSDRky8i_cLfYH_75aZt-UZIXCnlWak4XnpaSbtPwOWbahulpg8bwJtuvsgGRf-hsWdbGi4t5DDAij3qHzDH3dyhRIYH9e_mygVYi8I7Uk54TqcC7x0t7mDZ-sxeho/s1600/step5.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="288" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhIrd5dYdFmOsgXEtSDRky8i_cLfYH_75aZt-UZIXCnlWak4XnpaSbtPwOWbahulpg8bwJtuvsgGRf-hsWdbGi4t5DDAij3qHzDH3dyhRIYH9e_mygVYi8I7Uk54TqcC7x0t7mDZ-sxeho/s640/step5.jpg" width="640" /></a></div>
<br />
<br />
<span style="color: blue;"><span style="color: black;"><span style="color: blue;"><span style="color: black;"><u><i><b>Execution Log</b></i></u></span></span></span></span><br />
<br />
<span style="color: blue; font-size: 12px;"><span style="color: black;"><span style="color: blue;"><span style="color: black;"><i><span style="color: blue;">trf_Demo_Load_DataGrid_File - Transformation is pre-loaded.<br />Spoon - Transformation opened.<br />Spoon - Launching transformation [trf_Demo_Load_DataGrid_File]...<br />Spoon - Started the transformation execution.<br />trf_Demo_Load_DataGrid_File - Dispatching started for transformation [trf_Demo_Load_DataGrid_File]<br />trf_Demo_Load_DataGrid_File - Nr of arguments detected:0<br />trf_Demo_Load_DataGrid_File - This is not a replay transformation<br />trf_Demo_Load_DataGrid_File - I found 2 different steps to launch.<br />trf_Demo_Load_DataGrid_File - Allocating rowsets...<br />trf_Demo_Load_DataGrid_File - Allocating rowsets for step 0 --> Data Grid<br />trf_Demo_Load_DataGrid_File - prevcopies = 1, nextcopies=1<br />trf_Demo_Load_DataGrid_File - Transformation allocated new rowset [Data Grid.0 - Dummy (do nothing).0]<br />trf_Demo_Load_DataGrid_File - Allocated 1 rowsets for step 0 --> Data Grid<br />trf_Demo_Load_DataGrid_File - Allocating rowsets for step 1 --> Dummy (do nothing)<br />trf_Demo_Load_DataGrid_File - Allocated 1 rowsets for step 1 --> Dummy (do nothing)<br />trf_Demo_Load_DataGrid_File - Allocating Steps & StepData...<br />trf_Demo_Load_DataGrid_File - Transformation is about to allocate step [Data Grid] of type [DataGrid]<br />Data Grid.0 - distribution activated<br />Data Grid.0 - Starting allocation of buffers & new threads...<br />Data Grid.0 - Step info: nrinput=0 nroutput=1<br />Data Grid.0 - output rel. is 1:1<br />Data Grid.0 - Found output rowset [Data Grid.0 - Dummy (do nothing).0]<br />Data Grid.0 - Finished dispatching<br />trf_Demo_Load_DataGrid_File - Transformation has allocated a new step: [Data Grid].0<br />trf_Demo_Load_DataGrid_File - Transformation is about to allocate step [Dummy (do nothing)] of type [Dummy]<br />Dummy (do nothing).0 - distribution activated<br />Dummy (do nothing).0 - Starting allocation of buffers & new threads...<br />Dummy (do nothing).0 - Step info: nrinput=1 nroutput=0<br />Dummy (do nothing).0 - Got previous step from [Dummy (do nothing)] #0 --> Data Grid<br />Dummy (do nothing).0 - input rel is 1:1<br />Dummy (do nothing).0 - Found input rowset [Data Grid.0 - Dummy (do nothing).0]<br />Dummy (do nothing).0 - Finished dispatching<br />trf_Demo_Load_DataGrid_File - Transformation has allocated a new step: [Dummy (do nothing)].0<br />trf_Demo_Load_DataGrid_File - This transformation can be replayed with replay date: 2015/09/19 21:35:20<br />trf_Demo_Load_DataGrid_File - Initialising 2 steps...<br />Data Grid.0 - Released server socket on port 0<br />Dummy (do nothing).0 - Released server socket on port 0<br />trf_Demo_Load_DataGrid_File - Step [Data Grid.0] initialized flawlessly.<br />trf_Demo_Load_DataGrid_File - Step [Dummy (do nothing).0] initialized flawlessly.<br />trf_Demo_Load_DataGrid_File - Transformation has allocated 2 threads and 1 rowsets.<br />Data Grid.0 - Starting to run...<br />Dummy (do nothing).0 - Starting to run...<br />Data Grid.0 - Finished processing (I=0, O=0, R=0, W=5, U=0, E=0)<br />Dummy (do nothing).0 - Finished processing (I=0, O=0, R=5, W=5, U=0, E=0)<br />trf_Demo_Load_DataGrid_File - Handling extension point for plugin with id 'DataRefineryTransFinishListener' and extension point id 'TransformationFinish'<br />Spoon - The transformation has finished!!</span></i><br /> </span></span></span></span>Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com1tag:blogger.com,1999:blog-7135267265798115356.post-8908592967223134672015-09-19T20:12:00.000-07:002015-09-19T20:12:39.485-07:00Pentaho Common Errors : Error converting data while looking up value<b><u><span style="background-color: orange;">Error Message </span></u></b><br />
<br />
<span style="font-size: x-small;">Stream lookup.0 - ERROR (version 5.4.0.1-130, build 1 from 2015-06-14_12-34-55 by buildguy) : Unexpected error<br />Stream lookup.0 - ERROR (version 5.4.0.1-130, build 1 from 2015-06-14_12-34-55 by buildguy) : org.pentaho.di.core.exception.KettleStepException: <br /><span style="color: red;"><b>Stream lookup.0 - Error converting data while looking up value</b></span><br />Stream lookup.0 - </span><br />
<a name='more'></a><span style="font-size: x-small;"><br />Stream lookup.0 - Unexpected conversion error while converting value [prod_id String(15)<binary-string>] to an Integer<br />Stream lookup.0 - java.lang.String cannot be cast to [B<br />Stream lookup.0 - <br />Stream lookup.0 - at org.pentaho.di.trans.steps.streamlookup.StreamLookup.lookupValues(StreamLookup.java:267)<br />Stream lookup.0 - at org.pentaho.di.trans.steps.streamlookup.StreamLookup.processRow(StreamLookup.java:446)<br />Stream lookup.0 - at org.pentaho.di.trans.step.RunThread.run(RunThread.java:62)<br />Stream lookup.0 - at java.lang.Thread.run(Unknown Source)<br />Stream lookup.0 - Caused by: org.pentaho.di.core.exception.KettleValueException: <br />Stream lookup.0 - Unexpected conversion error while converting value [prod_id String(15)<binary-string>] to an Integer<br />Stream lookup.0 - java.lang.String cannot be cast to [B<br />Stream lookup.0 - <br />Stream lookup.0 - at org.pentaho.di.core.row.value.ValueMetaBase.getInteger(ValueMetaBase.java:1783)<br />Stream lookup.0 - at org.pentaho.di.core.row.value.ValueMetaBase.convertDataCompatible(ValueMetaBase.java:3581)<br />Stream lookup.0 - at org.pentaho.di.trans.steps.streamlookup.StreamLookup.lookupValues(StreamLookup.java:265)<br />Stream lookup.0 - ... 3 more<br />Stream lookup.0 - Caused by: java.lang.ClassCastException: java.lang.String cannot be cast to [B<br />Stream lookup.0 - at org.pentaho.di.core.row.value.ValueMetaBase.getInteger(ValueMetaBase.java:1723)<br />Stream lookup.0 - ... 5 more<br />Stream lookup.0 - Finished processing (I=0, O=0, R=10001, W=0, U=0, E=1)<br />trf_Demo_Load_CSV_File - Transformation detected one or more steps with errors.<br />trf_Demo_Load_CSV_File - Transformation is killing the other steps!</span><br />
<br />
<b><u><span style="background-color: lime;"><span style="background-color: orange;">Root Cause</span></span></u></b><br />
<br />
Stream lookup - lookup value match error.<br />
Data type not matching for keys to lookup.<br />
<br />
<u><b>Example :</b></u><br />
Data types for prod_id and id not matching.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj48dKCnyiwH7j1yLnA3kP0zQCzvxHLS-r2UZ8qf4VEX7kTamncCGZWaSlGyaU-qp1gcxHeDIgUX4-7rJDFmQN_gZPgnWn8NVsTcxYdGecHu6VOS4-joe4bP01Isqn3rAhsBiSerKIONs8/s1600/step+1.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="228" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj48dKCnyiwH7j1yLnA3kP0zQCzvxHLS-r2UZ8qf4VEX7kTamncCGZWaSlGyaU-qp1gcxHeDIgUX4-7rJDFmQN_gZPgnWn8NVsTcxYdGecHu6VOS4-joe4bP01Isqn3rAhsBiSerKIONs8/s640/step+1.jpg" width="640" /></a></div>
<br />
<br />
<b><u><span style="background-color: lime;">Resolution</span></u></b><br />
<br />
Data types for prod_id and id should be matching.Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com2tag:blogger.com,1999:blog-7135267265798115356.post-58921756791338967992015-09-19T17:36:00.000-07:002015-09-19T18:32:03.840-07:00Pentaho Data Integration - CSV File Input with parallel execution enabledCSV file input is a commonly used input step to read delimited files. Options are similar to text file input steps. Here are the general configurable options.<br />
<br />
<ul>
</ul>
<ol>
<li>File name<i><span style="color: #cc0000;"> - Input file name.</span></i></li>
<li>Delimiter<i><span style="color: #cc0000;"> - Support common delimiters like coma, tab, pipe e.t.c</span></i></li>
<li>Enclosure<i><span style="color: #cc0000;"> - Optional enclosures like double quotes.</span></i></li>
<li>NIO buffer size<i><span style="color: #cc0000;"> - Read buffer size.</span></i></li>
<li><span style="color: #cc0000;"><span style="color: black;">Lazy Conversion</span></span><i><span style="color: #cc0000;"> - Significant performance improvement by avoiding data type conversions. Check this option only if the logic is mere pass through.</span></i><a name='more'></a> </li>
<li><span style="color: #cc0000;"><span style="color: black;">Header row present</span></span><i><span style="color: #cc0000;"> - Enable in case the file has header row.</span></i></li>
<li><span style="color: #cc0000;"><span style="color: black;">Add file name to result</span></span><i><span style="color: #cc0000;"> - File name or file names will be added to the result in memory and can be passed into the next job entry.</span></i></li>
<li><span style="color: #cc0000;"><span style="color: black;">Running in Parallel</span></span><i><span style="color: #cc0000;"> - This step can run in multiple copies and will read separate blocks of the file. Configure along with clustered or copy mode.</span></i></li>
<li><span style="color: #cc0000;"><span style="color: black;">File encoding </span></span><i><span style="color: #cc0000;">- UTF-8, ASCII e.t.c.</span></i></li>
</ol>
<ul>
</ul>
<br />
Here are step wise illustrations on how to use CSV input step.<br />
<br />
<span style="color: blue; font-size: 26px;"><u><b>Without Parallel Execution Option</b></u></span><br />
<br />
<span style="color: #cc0000; font-size: 18px;"><u><b>Step 1 </b></u></span><br />
Create a new transformation for demo and drag the CSV input file into the canvas.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhIMVeNOCk5PITOOH4CIJIF3yzHXMLgk8x8GBCuz-DlbfxAj1_pq2MHKUFmLSSHINpTZXZhbEQcSBiHeMgJ-Sy0PTLx2trO2cHW6WH6Xn2qtzpAe6u9JGTM-xeaUpdAz8tlVJL1rhuL2ns/s1600/step1.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="331" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhIMVeNOCk5PITOOH4CIJIF3yzHXMLgk8x8GBCuz-DlbfxAj1_pq2MHKUFmLSSHINpTZXZhbEQcSBiHeMgJ-Sy0PTLx2trO2cHW6WH6Xn2qtzpAe6u9JGTM-xeaUpdAz8tlVJL1rhuL2ns/s640/step1.jpg" width="640" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<span style="color: #cc0000; font-size: 18px;"><u><b>Step 2</b></u></span><br />
Double click on the CSV input step and configure options.<br />
<br />
<ul>
<li>File name : Browse and select from local server.</li>
<li>Delimiter : Coma for demo purposes.</li>
<li>Lazy conversion : Disabled.</li>
<li>Running in parallel : Disabled.</li>
<li>File encoding : UTF-8 selected for demo.</li>
</ul>
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi1Aj1lpzge-NNC8Z6cKxPlRfNrlRvY1dOB7V2XRObfySJMfG-XN9pqc0PL-6KQx_O6lzbFodMd4x5O_-CUYVWrA4rdQgA_nwgcIhsa1htgmoe7pzv7clMkjGa5UPBXFDoB2KCMG1zNXNM/s1600/step2.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="313" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi1Aj1lpzge-NNC8Z6cKxPlRfNrlRvY1dOB7V2XRObfySJMfG-XN9pqc0PL-6KQx_O6lzbFodMd4x5O_-CUYVWrA4rdQgA_nwgcIhsa1htgmoe7pzv7clMkjGa5UPBXFDoB2KCMG1zNXNM/s640/step2.jpg" width="640" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<span style="color: #cc0000; font-size: 18px;"><u><b>Step 3</b></u></span><br />
Click on "Get Fields" to populate field names based on sample data.<br />
Configure appropriate trim type.<br />
<br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg4jv-u95Bqz5_2WnRY0MSguHjvsEO_YWN5BnWTEC8t_NhupNBbVtAVTVTc8pupWKGyUVVQvwovR0tpzcjrmG7VgpzAFJJ0zYbospyjaJ6smQ5Twnuc49o_NiVdQ9ZjGN3_aSGMeYvJ85s/s1600/step3.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="412" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg4jv-u95Bqz5_2WnRY0MSguHjvsEO_YWN5BnWTEC8t_NhupNBbVtAVTVTc8pupWKGyUVVQvwovR0tpzcjrmG7VgpzAFJJ0zYbospyjaJ6smQ5Twnuc49o_NiVdQ9ZjGN3_aSGMeYvJ85s/s640/step3.jpg" width="640" /></a>Here are the results of document scan.<span style="color: blue;"><b><i><span style="font-size: 12px;"> </span></i></b></span><br />
<span style="color: blue;"><b><i><span style="font-size: 12px;">Result after scanning 100 lines.</span></i></b></span><br />
<span style="color: blue;"><b><i><span style="font-size: 12px;">---------------------------------------</span></i></b></span><br />
<span style="color: blue; font-size: 12px;"><i><u>Field nr. 1 </u>:<br /> Field name : id Field type : Integer<br /><u>Field nr. 2</u> :<br /> Field name : code Field type : String Maximum length : 10 Minimum value : PRD_CD_ACF Maximum value : PRD_CD_ZWQ Nr of null values : 1<br /><u>Field nr. 3</u> :<br /> Field name : description Field type : String Maximum length : 13 Minimum value : aafwtscm-desc Maximum value : zyirgmfn-desc Nr of null values : 1<br /><u>Field nr. 4</u> :<br /> Field name : category Field type : String Maximum length : 1 Minimum value : A Maximum value : Z Nr of null values : 1<br /><u>Field nr. 5 </u>:<br /> Field name : price Field type : Number Estimated length : 15 Estimated precision : 5 Number format : #.# WARNING: More then 1 number format seems to match all sampled records: Number format : #.# Trim Type : 0 Minimum value : 252.44767Maximum value : 1246.517 Example : #.#, number [252.44767] gives 252.44767</i></span><br />
<br />
<span style="color: #cc0000; font-size: 18px;"><u><b>Step 4</b></u></span><br />
Add a dummy step to complete the flow.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgcpk5IVLkSdEnAryygumal2ihATia8QYriJdwTVojimCx1NwoBHH_GaUcmYxQXiHXFKar2h0s_Zc3gJbsI6UNVhGgercP9Y0GrBjRnN4DdbQ9l-g7tt_bFFsnNMmjoAHyYGdoMuzkZ0lk/s1600/step4.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="353" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgcpk5IVLkSdEnAryygumal2ihATia8QYriJdwTVojimCx1NwoBHH_GaUcmYxQXiHXFKar2h0s_Zc3gJbsI6UNVhGgercP9Y0GrBjRnN4DdbQ9l-g7tt_bFFsnNMmjoAHyYGdoMuzkZ0lk/s640/step4.jpg" width="640" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<span style="color: #cc0000; font-size: 18px;"><u><b>Step 5</b></u></span><br />
Here are the execution results for the demo transformation.<br />
<i><u><b>Step Metrics</b></u></i><br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiiZX40AEwVBY1BPtJ2uwB13vBVWntwAlpWVOIPb9-eRsFdTAIZ3odp4ZxCcrL55NdjyNMh2sbG2ZrUf-6sAvDz0pEFPFQgN_kO7PrFxlwvPAQrtZo0dhPSkQTbaz4iqUqd-LAShQuvDQ0/s1600/step5.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="243" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiiZX40AEwVBY1BPtJ2uwB13vBVWntwAlpWVOIPb9-eRsFdTAIZ3odp4ZxCcrL55NdjyNMh2sbG2ZrUf-6sAvDz0pEFPFQgN_kO7PrFxlwvPAQrtZo0dhPSkQTbaz4iqUqd-LAShQuvDQ0/s640/step5.jpg" width="640" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<i><u><b>Preview Data</b></u></i><br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgIUz8Gep2P1j__pfWhEf31ja6CH72XhTdPya5BgI2NH7ff6t9EiTmO0Gq1HrrwMGmsK64T5bhqLvTM_pc82LWaVxxEkPbsanYu6hjSIqnudbHjwHodhH5JRqfQ3GfnuaQaLxp8W5uEcac/s1600/step6.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="251" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgIUz8Gep2P1j__pfWhEf31ja6CH72XhTdPya5BgI2NH7ff6t9EiTmO0Gq1HrrwMGmsK64T5bhqLvTM_pc82LWaVxxEkPbsanYu6hjSIqnudbHjwHodhH5JRqfQ3GfnuaQaLxp8W5uEcac/s400/step6.jpg" width="400" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<i><u><b>Execution Log</b></u></i><br />
<span style="color: blue; font-size: 12px;"><i> General - Logging plugin type found with ID: CheckpointLogTable<br /> General - Starting agile-bi<br /> Version checker - OK<br /> class org.pentaho.agilebi.platform.JettyServer - WebServer.Log.CreateListener localhost:10000<br /> Spoon - Transformation opened.<br /> Spoon - Launching transformation [trf_Demo_Load_CSV_File]...<br /> Spoon - Started the transformation execution.<br /> trf_Demo_Load_CSV_File - Dispatching started for transformation [trf_Demo_Load_CSV_File]<br /> CSV file input.0 - Header row skipped in file 'products_data.csv'<br /> CSV file input.0 - Finished processing (I=10001, O=0, R=0, W=10000, U=0, E=0)<br /> Dummy (do nothing).0 - Finished processing (I=0, O=0, R=10000, W=10000, U=0, E=0)<br /> Spoon - The transformation has finished!!</i></span><br />
<br />
<span style="color: blue; font-size: 26px;"><u><b>With Parallel Execution Option</b></u></span><br />
<br />
<span style="color: #cc0000; font-size: 18px;"><u><b>Step 6</b></u></span><br />
Enable "Running in parallel" option.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjKoP50nnBo9UpOgofeKLItV2ILTne5HJruw7r8AuDdETE9n1pt4RNVirUhhBdtQ2bS30e7NL4Jb5eV-Mj9Lpk8TGi_xxPiIycOhlKsNz7LVE4psX9-52Ko6HrbL52VANbJm0aL7Ud-6sg/s1600/step7.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="305" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjKoP50nnBo9UpOgofeKLItV2ILTne5HJruw7r8AuDdETE9n1pt4RNVirUhhBdtQ2bS30e7NL4Jb5eV-Mj9Lpk8TGi_xxPiIycOhlKsNz7LVE4psX9-52Ko6HrbL52VANbJm0aL7Ud-6sg/s640/step7.jpg" width="640" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<span style="color: #cc0000; font-size: 18px;"><u><b>Step 7</b></u></span> </div>
Change number of copies to start with from 1 to say 5 for demo purpose.<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEilAKyvb87P02Qc-VWm8_nQiBfv6CanzS5Kg37xWL_bGgOFmXhCyKX1xz3pK_PSIm_EU9mXicKl4H6PTVEAtp4q13b5s5v5p-hiWG08ZJH3vGi746GKbY85w2P4IYPUfghc-PxWL0zj9zs/s1600/step8.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="440" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEilAKyvb87P02Qc-VWm8_nQiBfv6CanzS5Kg37xWL_bGgOFmXhCyKX1xz3pK_PSIm_EU9mXicKl4H6PTVEAtp4q13b5s5v5p-hiWG08ZJH3vGi746GKbY85w2P4IYPUfghc-PxWL0zj9zs/s640/step8.jpg" width="640" /></a><span style="color: #cc0000; font-size: 18px;"><u><b> </b></u></span><br />
<span style="color: #cc0000; font-size: 18px;"><u><b>Step 8</b></u></span><br />
Select data movement as "Round robin".<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgBoxZ1N7ZNEMar3jS5PEZ0poRstbvTdkAhBsxYGcihP6ymSPQpT7ReFt6MetW6z2n00xGDgxmvYJBq6TRDVx2MZZ_4zPOVjUp1fWUGGEOPHBCqNvzVlUqfUPGfZ5zEKVijRGMedaJHTQs/s1600/step10.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="291" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgBoxZ1N7ZNEMar3jS5PEZ0poRstbvTdkAhBsxYGcihP6ymSPQpT7ReFt6MetW6z2n00xGDgxmvYJBq6TRDVx2MZZ_4zPOVjUp1fWUGGEOPHBCqNvzVlUqfUPGfZ5zEKVijRGMedaJHTQs/s640/step10.jpg" width="640" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<span style="color: #cc0000; font-size: 18px;"><u><b>Step 9</b></u></span><br />
Execute the demo transformation. 5 parallel copies of the input step got executed, each reading a different block of the input file.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg6HdluWufbD0X22lgeav62dlMKVTOLNzMjgpOKZ8-4PsGNw-N-cj1RqFHDtfx-omYeqy4GKDdyLnAlcrcRWaVX-dTcO_ez8WxoAxsN4sS6ILe9gTwOZt3QcpYY9PV7lrYY2WiCr8Zc9ug/s1600/step11.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="227" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg6HdluWufbD0X22lgeav62dlMKVTOLNzMjgpOKZ8-4PsGNw-N-cj1RqFHDtfx-omYeqy4GKDdyLnAlcrcRWaVX-dTcO_ez8WxoAxsN4sS6ILe9gTwOZt3QcpYY9PV7lrYY2WiCr8Zc9ug/s640/step11.jpg" width="640" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<span style="color: #cc0000; font-size: 18px;"><u><b>Step 10</b></u></span><br />
<i><u><b>Execution Log</b></u></i><span style="color: blue; font-size: 12px;"><i><br /> Spoon - Transformation opened.<br /> Spoon - Launching transformation [trf_Demo_Load_CSV_File]...<br /> Spoon - Started the transformation execution.<br /> trf_Demo_Load_CSV_File - Dispatching started for transformation [trf_Demo_Load_CSV_File]<br /> CSV file input.0 - We start to read from file 'products_data.csv' (430326 bytes). Then we skip 0 bytes to the starting position and read 86065 bytes.<br /> CSV file input.1 - We start to read from file 'products_data.csv' (430326 bytes). Then we skip 86065 bytes to the starting position and read 86065 bytes.<br /> CSV file input.0 - Header row skipped in file 'products_data.csv'<br /> CSV file input.4 - We start to read from file 'products_data.csv' (430326 bytes). Then we skip 344260 bytes to the starting position and read 86065 bytes.<br /> CSV file input.1 - Finished processing (I=1995, O=0, R=0, W=1995, U=0, E=0)<br /> CSV file input.0 - Finished processing (I=2022, O=0, R=0, W=2021, U=0, E=0)<br /> CSV file input.2 - We start to read from file 'products_data.csv' (430326 bytes). Then we skip 172130 bytes to the starting position and read 86065 bytes.<br /> CSV file input.3 - We start to read from file 'products_data.csv' (430326 bytes). Then we skip 258195 bytes to the starting position and read 86065 bytes.<br /> CSV file input.4 - Finished processing (I=1995, O=0, R=0, W=1995, U=0, E=0)<br /> CSV file input.2 - Finished processing (I=1993, O=0, R=0, W=1993, U=0, E=0)<br /> CSV file input.3 - Finished processing (I=1996, O=0, R=0, W=1996, U=0, E=0)<br /> Dummy (do nothing).0 - Finished processing (I=0, O=0, R=10000, W=10000, U=0, E=0)<br /> Spoon - The transformation has finished!!</i></span>Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com2tag:blogger.com,1999:blog-7135267265798115356.post-45757210259063962032015-08-30T23:38:00.001-07:002015-08-30T23:40:36.907-07:00Pentaho Data Integration - PDI 5.4 Installation for Windows 64 bitPentaho 5.4 is the latest Pentaho version with powerful features. <br />
Stepwise illustration on how to install Pentaho Data Integration 5.4 is given below.<br />
<br />
Here are some of the highlights of the new version.<br />
<br />
<a name='more'></a><br />
<ul>
<li><span style="color: blue;">Streamlined Data Refinery</span></li>
<li><span style="color: blue;">Analyzer APIs and documentation.</span></li>
<li><span style="color: blue;">Improvements to the Pentaho Operations Mart. </span></li>
<li><span style="color: blue;">New scheduling PDI APIs.</span></li>
<li><span style="color: blue;">POST methods for Carte. </span></li>
<li><span style="color: blue;">New support for SAP HANA, Sqoop, and Spark.</span></li>
<li><span style="color: blue;">Clustering improvements for Hadoop.</span></li>
<li><span style="color: blue;">Minor functionality improvements for PDI steps.</span></li>
</ul>
<strong><u></u></strong><br />
<strong><u>Step 1</u></strong><br />
<strong><u></u></strong><br />
Download PDI 5.4 installation software "pdi-5.4.0.1-130-x64.exe" from Pentaho site.<br />
<br />
<a href="http://www.pentaho.com/download">http://www.pentaho.com/download</a><br />
<br />
Execute the installation file.<br />
<br />
<strong><u>Step 2</u></strong><br />
<strong><u></u></strong><br />
Pentaho installation wizard will popup and will guide through the installation.<br />
Click Next.<br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgPGnjGVauXOM02cQsZ3Ry7uqiUYgMmHnLZ9sCI2tw6cE8zfzwTpMDJxDNrnVQ1MF5pI318yciqck2kmxEoPz0XtuRgpnS3InwM10GzdmCNsHz0HvtdGS32hdXecKRnzK4SK_ZKDhdnQRw/s1600/step1.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="291" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgPGnjGVauXOM02cQsZ3Ry7uqiUYgMmHnLZ9sCI2tw6cE8zfzwTpMDJxDNrnVQ1MF5pI318yciqck2kmxEoPz0XtuRgpnS3InwM10GzdmCNsHz0HvtdGS32hdXecKRnzK4SK_ZKDhdnQRw/s400/step1.jpg" width="400" /></a><br />
<br />
<br />
<br />
<strong><u>Step 3</u></strong><br />
<strong><u></u></strong><br />
Accept license agreements.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjYYdmBbCyyaMkGeSF5KMNzaJpDnSieLDFCO8prL6nPAIgTOOMhOSm_f_5gw6b_I5P3eVfgxOY35DCbt1iRmmbwDb77hrRqgYFcEai5lPtlenEBq9FiKfkBGEfuWExkHReKXSKpq0_Hpc0/s1600/step2.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="290" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjYYdmBbCyyaMkGeSF5KMNzaJpDnSieLDFCO8prL6nPAIgTOOMhOSm_f_5gw6b_I5P3eVfgxOY35DCbt1iRmmbwDb77hrRqgYFcEai5lPtlenEBq9FiKfkBGEfuWExkHReKXSKpq0_Hpc0/s400/step2.jpg" width="400" /></a></div>
<br />
<br />
<strong><u></u></strong><br />
<strong><u>Step 4</u></strong><br />
<strong><u></u></strong><br />
Specify the location to install the Pentaho applications.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi1dn9hAF4X4W_RKHmgshseHVUnfXcm5qNXO3bXhdShARodXns7ZzpKKc3QIkYcikLTR_SmqNz7TuLjYKsf-NXXW3ykDbj-WH4wCEDWOO5hSIGz7LW9_PudYwTjSTimDVs8sgkFo6DkwTo/s1600/step3.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="290" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi1dn9hAF4X4W_RKHmgshseHVUnfXcm5qNXO3bXhdShARodXns7ZzpKKc3QIkYcikLTR_SmqNz7TuLjYKsf-NXXW3ykDbj-WH4wCEDWOO5hSIGz7LW9_PudYwTjSTimDVs8sgkFo6DkwTo/s400/step3.jpg" width="400" /></a></div>
<br />
<br />
<strong><u>Step 5</u></strong><br />
<strong><u></u></strong><br />
Postgres will be installed along with PDI to store reports and other user information. Please specify password for postgres.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgo__Cw-3TBDHS0sSSO16XfOvenCQem9nLXsjVLiNQ9zVWMGThFuqAe1SJ4ObKZVHg6HmOUZnRdsbuMvxUqvJxuyVtF9qcei-4durRak4sgNlbI1zOiUo-ycyexogp-2zgMW6Dg41jrRDE/s1600/step4.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="292" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgo__Cw-3TBDHS0sSSO16XfOvenCQem9nLXsjVLiNQ9zVWMGThFuqAe1SJ4ObKZVHg6HmOUZnRdsbuMvxUqvJxuyVtF9qcei-4durRak4sgNlbI1zOiUo-ycyexogp-2zgMW6Dg41jrRDE/s400/step4.jpg" width="400" /></a></div>
<br />
<br />
<strong><u>Step 6</u></strong><br />
<strong><u></u></strong><br />
Setup ready to install. Click next to proceed.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjvrqyeFfOX18f38uyl4iwrFVaCdNJ3w2J9vMMziutgihGofQ9-3dgk35mRyCmIjETEtXPR3JYHmWAwrh4SCIvyIOQy1Vej0cjxBE_EH1ylS3TSlqug0e3cnQ0hXy8BFyJ2dlHuq4NqduY/s1600/step5.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="290" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjvrqyeFfOX18f38uyl4iwrFVaCdNJ3w2J9vMMziutgihGofQ9-3dgk35mRyCmIjETEtXPR3JYHmWAwrh4SCIvyIOQy1Vej0cjxBE_EH1ylS3TSlqug0e3cnQ0hXy8BFyJ2dlHuq4NqduY/s400/step5.jpg" width="400" /></a></div>
<br />
<strong><u>Step 7</u></strong><br />
<strong><u></u></strong><br />
Installation in progress.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg7IMwBWZqdWjK-e5SGfyaQw5UBXWxVOUs2auVopCo3KWOErXcy559hWUu1z3G8XWMeO-opTn0a6S7IJgDfaJxCd-T9gQNEUXLOYFGc0U8xw033WwnLrWABt5gRAJ8gybq-uSDqcgi017A/s1600/step6.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="290" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg7IMwBWZqdWjK-e5SGfyaQw5UBXWxVOUs2auVopCo3KWOErXcy559hWUu1z3G8XWMeO-opTn0a6S7IJgDfaJxCd-T9gQNEUXLOYFGc0U8xw033WwnLrWABt5gRAJ8gybq-uSDqcgi017A/s400/step6.jpg" width="400" /></a></div>
<br />
<strong><u>Step 7</u></strong><br />
<strong><u></u></strong><br />
Installation completed successfully.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhpkxVI1IVJRCRS-0wrQNxWDd6T072WACgQYpcUKXz_0FmARRgVUKCyLA8vcywPRe2dpIizIJy9Cg-Y6qgsRGSlLIlY0fMtkK1wZi8MU204ke2dOTKs2AkY8UkO246ckf_sezjpm18XS1A/s1600/STEP7.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="290" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhpkxVI1IVJRCRS-0wrQNxWDd6T072WACgQYpcUKXz_0FmARRgVUKCyLA8vcywPRe2dpIizIJy9Cg-Y6qgsRGSlLIlY0fMtkK1wZi8MU204ke2dOTKs2AkY8UkO246ckf_sezjpm18XS1A/s400/STEP7.jpg" width="400" /></a></div>
<br />
<br />
<strong><u>Installation Summary</u></strong><br />
<strong><u></u></strong><br />
<u>Selections/Settings:</u><br />
- Data Integration Server<br />
- Data Integration Server port: 9080<br />
- Monet Database Port: 50006 <br />
- Embedded Database Port: 9092Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com18tag:blogger.com,1999:blog-7135267265798115356.post-22616482445912319122014-03-12T22:31:00.001-07:002015-08-28T22:15:23.580-07:00Pentaho Data Integration : Google AnalyticsGoogle Analytics service provide details about a website's traffic. This service track various statistics and can be integrated with AdWords to review online campaigns.<br />
<br />
Pentaho Google Analytics step allows to extract Google Analytics data.<br />
Stepwise illustration given below.<br />
<br />
<strong><u>Step 1</u></strong><br />
<strong><u></u></strong><br />
Enable Google Analytics and generate API key.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj44tm0IZCt53k8k7C9R1VU5D0YP5-FuzvEzr5ur_V8V8eIBmG4u4waYxYmSdIv_Su7nBICjWGvI0SbaKPSzPXozqK4gbzFLvmsiC_5_Mc3YwuKJbQt2gl2T3Gl5uUyB282XcWvNWWfGBo/s1600/tag.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="1" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj44tm0IZCt53k8k7C9R1VU5D0YP5-FuzvEzr5ur_V8V8eIBmG4u4waYxYmSdIv_Su7nBICjWGvI0SbaKPSzPXozqK4gbzFLvmsiC_5_Mc3YwuKJbQt2gl2T3Gl5uUyB282XcWvNWWfGBo/s1600/tag.jpg" width="1" /></a></div>
<a name='more'></a><br />
<br />
<br />
<ol>
<li>Log into <a href="http://developers.google.com/">http://developers.google.com</a> using gmail id.</li>
<li>Click on API Console.</li>
<li>Enable Analytics API.</li>
<li>Copy API key from API access tab.</li>
</ol>
<br />
<strong><u>Step 2</u></strong><br />
<strong><u></u></strong><br />
Create a sample transformation. <br />
Drag "Google Analytics" step into the transformation canvas.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiCQY-8F0TDvOIUOZaLPNW-XaOA6gzah781sv6NCFkkisSylei3nwajGhIkpDMHhU-YI6l4QjIioWzsja90pdFgew_lkTAQtGFce6OVSNXsppj3J8iuvE2WqAMa_-0CMkvOxhXmwuE5HSw/s1600/copy1.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="163" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiCQY-8F0TDvOIUOZaLPNW-XaOA6gzah781sv6NCFkkisSylei3nwajGhIkpDMHhU-YI6l4QjIioWzsja90pdFgew_lkTAQtGFce6OVSNXsppj3J8iuvE2WqAMa_-0CMkvOxhXmwuE5HSw/s1600/copy1.jpg" width="400" /></a></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<strong><u>Step 3</u></strong><br />
<strong><u></u></strong><br />
Configure Google Analytics step.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgen_4ycrGyRpzPiEVMVbNPGZM36MMk4bVim3620guwz2dpPV65aPdu1-6XSRO12sY2rClaQRDi8ZSHNWbvAAsOt_vkPMS-mmi1XILHbK97AY5w05UiRdZIinp6rLgAZbon8yiIAqbgZ80/s1600/copy2.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="640" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgen_4ycrGyRpzPiEVMVbNPGZM36MMk4bVim3620guwz2dpPV65aPdu1-6XSRO12sY2rClaQRDi8ZSHNWbvAAsOt_vkPMS-mmi1XILHbK97AY5w05UiRdZIinp6rLgAZbon8yiIAqbgZ80/s1600/copy2.jpg" width="571" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<em><u><strong>Connection Settings</strong></u></em><br />
<u></u><br />
<ul>
<li><span style="color: blue;"><em>Application Name</em> : </span><span style="color: black;">Leave as default value <em>"type-exit.org kettle plugin"</em></span></li>
<li><span style="color: blue;"><em>Email </em>: </span><span style="color: black;">Google account user id</span></li>
<li><span style="color: blue;"><em>Password</em> : </span><span style="color: black;">Google account password</span></li>
<li><span style="color: blue;"><em>API Key</em> : </span><span style="color: black;">API key from step 1.</span></li>
</ul>
<br />
Click on "Get Profiles" button to populate table id from profile.<br />
<br />
<strong><em><u>Query Definition</u></em></strong><br />
<br />
<ul>
<li><span style="color: blue;"><em>Start Date</em> ( YYYY-MM-DD ) : </span><span style="color: black;">Data extract start date.</span></li>
<li><span style="color: blue;"><em>End Date</em> ( YYYY-MM-DD ) : </span><span style="color: black;">Data extract End date.</span></li>
<li><span style="color: blue;"><em>Filters</em> : </span><span style="color: black;">Selection criteria filter for the data extract.</span></li>
<li><span style="color: blue;"><em>Sort</em> : </span><span style="color: black;">Sort by field.</span></li>
</ul>
<span style="color: blue;"></span><br />
<ul>
<li><span style="color: blue;"><em>Dimensions</em> : </span><span style="color: black;">Dimensions to query on.</span></li>
</ul>
<span style="color: black;">Here are some of the commonly used dimensions.</span><br />
<br />
<ol>
<li>ga:continent</li>
<li>ga:subContinent</li>
<li>ga:country</li>
<li>ga:region</li>
<li>ga:metro</li>
<li>ga:city</li>
<li>ga:latitude</li>
<li>ga:longitude</li>
<li>ga:networkDomain</li>
<li>ga:visitorType</li>
<li>ga:visitCount</li>
<li>ga:daysSinceLastVisit</li>
</ol>
<br />
<ul>
<li><span style="color: blue;"><em>Metrics</em> : </span><span style="color: black;">Measures to retrieve.</span></li>
</ul>
<span style="color: blue;"> </span><span style="color: black;">Here are some of the commonly used metrics.</span><br />
<br />
<ol>
<li>ga:visitors</li>
<li>ga:newVisits</li>
<li>ga:percentNewVisits</li>
<li>ga:pageviews</li>
<li>ga:pageviewsPerVisit</li>
<li>ga:uniquePageviews</li>
<li>ga:timeOnPage</li>
<li>ga:avgTimeOnPage</li>
<li>ga:exits</li>
</ol>
<strong><u></u></strong><br />
<strong><u>Step 4</u></strong><br />
<strong><u></u></strong><br />
Preview data.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhhM89oAvuUWGKq17lNQHeNYDU2ppd3spnMUGy4U31kE3XXl4_5x4xC5RqC3-HC-YjryBm6hyphenhyphenBIIYHN0MxeqKA3_QsDOSF_CDG2iHcSnBRiQgpesYneTzC7sxEFGV1YoeDGOjLKIn_L0ws/s1600/copy5.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="410" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhhM89oAvuUWGKq17lNQHeNYDU2ppd3spnMUGy4U31kE3XXl4_5x4xC5RqC3-HC-YjryBm6hyphenhyphenBIIYHN0MxeqKA3_QsDOSF_CDG2iHcSnBRiQgpesYneTzC7sxEFGV1YoeDGOjLKIn_L0ws/s1600/copy5.jpg" width="640" /></a></div>
<br />
<br />
<strong><u>Step 5</u></strong><br />
<strong><u></u></strong><br />
Specify output file or table.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiSUL8lBV4hgPWbOtyDl9grd-nFB7Mh_g5Ycmkp8YLJ1DFz7cDjbZzqqY6S2IkdILr3_iM1Y20Zh57rD5ussvERvqEeq4q5HVSPt4avd1BKUP0XnQNykdZOydlKClYwbEEcuf7d7luHKQ8/s1600/copy3.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="166" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiSUL8lBV4hgPWbOtyDl9grd-nFB7Mh_g5Ycmkp8YLJ1DFz7cDjbZzqqY6S2IkdILr3_iM1Y20Zh57rD5ussvERvqEeq4q5HVSPt4avd1BKUP0XnQNykdZOydlKClYwbEEcuf7d7luHKQ8/s1600/copy3.jpg" width="400" /></a></div>
<br />
<br />
<strong><u><br /></u></strong>
<strong><u><br /></u></strong>
<strong><u><br /></u></strong>
<strong><u><br /></u></strong>
<strong><u><br /></u></strong>
<strong><u><br /></u></strong>
<strong><u><br /></u></strong>
<strong><u><br /></u></strong>
<strong><u><br /></u></strong>
<strong><u>Step 6</u></strong><br />
<strong><u></u></strong><br />
Execution results given below.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj5cl_FNik16l6TiihtzZUzf7YnnXg5PpsqC-wLTfzGP87dDImewAqZ9O-2UrbcPGT0PONxUF9GabH_W6tFHty1gfxNZl5E4EtDmQbRmoq7XKUoWppHFaW563-62LA1SqM4XZB29BPReR4/s1600/copy4.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="126" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj5cl_FNik16l6TiihtzZUzf7YnnXg5PpsqC-wLTfzGP87dDImewAqZ9O-2UrbcPGT0PONxUF9GabH_W6tFHty1gfxNZl5E4EtDmQbRmoq7XKUoWppHFaW563-62LA1SqM4XZB29BPReR4/s1600/copy4.jpg" width="400" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<em><span style="color: blue; font-size: xx-small;">2014/03/12 12:58:05 - Spoon - Asking for repository</span></em><br />
<div>
<em><span style="color: blue; font-size: xx-small;">2014/03/12 12:58:05 - Version checker - OK</span></em></div>
<div>
<em><span style="color: blue; font-size: xx-small;">2014/03/12 14:14:41 - Spoon - Transformation opened.</span></em></div>
<div>
<em><span style="color: blue; font-size: xx-small;">2014/03/12 14:14:41 - Spoon - Launching transformation [tr_google_analytics_sample]...</span></em></div>
<div>
<em><span style="color: blue; font-size: xx-small;">2014/03/12 14:14:41 - Spoon - Started the transformation execution.</span></em></div>
<div>
<em><span style="color: blue; font-size: xx-small;">2014/03/12 14:14:41 - tr_google_analytics_sample - Dispatching started for transformation [tr_google_analytics_sample]</span></em></div>
<div>
<em><span style="color: blue; font-size: xx-small;">2014/03/12 14:14:43 - Google Analytics.0 - Finished processing (I=0, O=0, R=0, W=32, U=0, E=0)</span></em></div>
<div>
<em><span style="color: blue; font-size: xx-small;">2014/03/12 14:14:43 - write_analytics_out.0 - Finished processing (I=0, O=33, R=32, W=32, U=0, E=0)</span></em></div>
<div>
<em><span style="color: blue; font-size: xx-small;">2014/03/12 14:14:43 - Spoon - The transformation has finished!!</span></em></div>
Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com0tag:blogger.com,1999:blog-7135267265798115356.post-74016588532968385662014-02-19T12:53:00.001-08:002015-08-28T22:16:00.307-07:00Pentaho Common Errors : Driver class 'org.gjt.mm.mysql.Driver' could not be found<span style="background-color: orange; margin: 0px; padding: 0px;"><strong><u><span style="color: #333333; font-family: Arial; font-size: x-small;">Error Message</span></u></strong></span><br />
<span style="font-size: x-small;">Error connecting to database [MySQLDev] : org.pentaho.di.core.exception.KettleDatabaseException: <br />Error occured while trying to connect to the database</span><br />
<span style="font-size: x-small;">Driver class 'org.gjt.mm.mysql.Driver' could not be found, make sure the 'MySQL' driver (jar file) is installed.<br />org.gjt.mm.mysql.Driver</span><br />
<a name='more'></a><span style="font-size: x-small;">org.pentaho.di.core.exception.KettleDatabaseException: <br />Error occured while trying to connect to the database</span><br />
<span style="font-size: x-small;">Driver class 'org.gjt.mm.mysql.Driver' could not be found, make sure the 'MySQL' driver (jar file) is installed.<br />org.gjt.mm.mysql.Driver</span><br />
<span style="font-size: x-small;"> at org.pentaho.di.core.database.Database.normalConnect(Database.java:415)<br /> at org.pentaho.di.core.database.Database.connect(Database.java:353)<br /> at org.pentaho.di.core.database.Database.connect(Database.java:306)</span><br />
<div class="separator" style="clear: both; text-align: center;">
</div>
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhI7_c8pS0HJnvEYyvblht4k-olXiR_LXwRNjrGXgGvIPasc9RPPXmAQgky-bHEa-Zc7TzrycfCwphcSJ3T0JypB-Krd8h7LnyqKswAqpGxCwSRN2lwhS22l7IdCOjg8VdlSrdNEgGAIEo/s1600/bug.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="1" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhI7_c8pS0HJnvEYyvblht4k-olXiR_LXwRNjrGXgGvIPasc9RPPXmAQgky-bHEa-Zc7TzrycfCwphcSJ3T0JypB-Krd8h7LnyqKswAqpGxCwSRN2lwhS22l7IdCOjg8VdlSrdNEgGAIEo/s1600/bug.jpg" width="1" /></a></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<span style="background-color: lime; margin: 0px; padding: 0px;"><strong><u><span style="color: #333333; font-family: Arial; font-size: x-small;">Resolution</span></u></strong></span></div>
<span style="font-size: x-small;"><em>Pentaho version : PDI 5.0.2</em></span><br />
<span style="font-size: x-small;"><em>OS : Linux CentOS release 6.2</em></span><br />
<br />
<strong><em>1) Download latest MySQL jdbc connector.</em></strong><br />
<br />
<a href="http://dev.mysql.com/downloads/connector/j/">http://dev.mysql.com/downloads/connector/j/</a><br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh_ZvgKr0UaYaX8I7u7iyhkaskiA1xjvpBwZZh_9kCri-iyxBnaINHItnkLagWONAAtlGdxG8zbZtX4XwOf7OISl8P7Jy3SKgSIi8E8g9HXj1i9mY7xA80gQv65obP7co6VQcM14wROgX8/s1600/copy1.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="136" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh_ZvgKr0UaYaX8I7u7iyhkaskiA1xjvpBwZZh_9kCri-iyxBnaINHItnkLagWONAAtlGdxG8zbZtX4XwOf7OISl8P7Jy3SKgSIi8E8g9HXj1i9mY7xA80gQv65obP7co6VQcM14wROgX8/s1600/copy1.jpg" width="400" /></a></div>
<span style="font-size: x-small;"></span><br />
<span style="background-color: lime; margin: 0px; padding: 0px;"><strong><u><span style="color: #333333; font-family: Arial; font-size: x-small;"></span></u></strong></span>Download platform independent TAR archive for Linux OS.<br />
ZIP Archive can be used for Windows.<br />
<br />
<strong><em>2) Copy jar file to folder.</em></strong><br />
<br />
<ul>
<li>Locate tar file.</li>
</ul>
<span style="color: blue; font-size: x-small;"> [root@localhost Downloads]# ls<br /> mysql-connector-java-5.1.29.tar.gz<br /> [root@localhost Downloads]#</span> <br />
<br />
<ul>
<li>Unzip the file using following command.</li>
</ul>
<span style="color: blue; font-size: x-small;"> [root@localhost Downloads]# tar -zxvf mysql-connector-java-5.1.29.tar.gz</span><br />
<span style="font-size: x-small;"></span><br />
<ul>
<li>Copy file to folder pentaho/design-tools/data-integration/lib/</li>
</ul>
<span style="color: blue; font-size: x-small;"> cp mysql-connector-java-5.1.29-bin.jar /opt/pentaho/design-tools/data-integration/lib/</span><br />
<br />
<strong><em> 3) Restart Spoon.</em></strong><br />
<br />
<br />
<br />
<br />
<br />
<span style="background-color: lime; margin: 0px; padding: 0px;"><strong><u><span style="color: #333333; font-family: Arial; font-size: x-small;"></span></u></strong></span><br />
<span style="background-color: lime; margin: 0px; padding: 0px;"><strong><u><span style="color: #333333; font-family: Arial; font-size: x-small;"></span></u></strong></span><br />Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com9tag:blogger.com,1999:blog-7135267265798115356.post-42526342712281492392014-02-18T20:46:00.000-08:002014-02-20T12:30:43.536-08:00Pentaho Big Data : Pig Script ExecutorApache Pig is a high level data analysis language capable of handling very high data volume. Ease of programming, parallelization, extensibility and optimization opportunities are some of the key features of this platform.<br />
<br />
Pig Script Executor job entry step can be used to execute "Pig Latin" script on a Hadoop cluster.<br />
Stepwise illustration on how to configure PDI "Pig Script Executor" is given below.<br />
<a name='more'></a><br />
<em><span style="font-size: x-small;">Hortonworks Sandbox version 2.0 used for demo purpose. Refer link for more info.</span></em><br />
<a href="http://hortonworks.com/products/hortonworks-sandbox/#overview"><em><span style="font-size: x-small;">http://hortonworks.com/products/hortonworks-sandbox/#overview</span></em></a><br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgI4s81JJMahInprTbIxUIcFYKXXFIFdDKcCmjTdtt7SniIh1kmS2HwjvxzLZzUbwlnbKotXR_8KV7jGw4wsuevJq75LAK2v5VydCppx6HYqXyfgfPtTXw9FyBdzUsftRfotAqVM9c3r6Q/s1600/hadoop1.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgI4s81JJMahInprTbIxUIcFYKXXFIFdDKcCmjTdtt7SniIh1kmS2HwjvxzLZzUbwlnbKotXR_8KV7jGw4wsuevJq75LAK2v5VydCppx6HYqXyfgfPtTXw9FyBdzUsftRfotAqVM9c3r6Q/s1600/hadoop1.jpg" height="1" width="1" /></a></div>
<br />
<strong><u>Step 1</u></strong><br />
<strong><u></u></strong><br />
Open Spoon and create a new job.<br />
Drag the "Pig Script Executor" step into the canvas.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgzrRoXD0HhPfPfvdBsosOeOU4wryT3Ewl7EGOAZmNkU0ri-4kEJWuK4KnULMpdiOeztNGQGnZQ-s7hULxhaezhM7iGLabF9xfCEpCdW5GDY8b6W8RNqYvntHiJuAY1u0KMNsSOazslvaQ/s1600/copy1.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgzrRoXD0HhPfPfvdBsosOeOU4wryT3Ewl7EGOAZmNkU0ri-4kEJWuK4KnULMpdiOeztNGQGnZQ-s7hULxhaezhM7iGLabF9xfCEpCdW5GDY8b6W8RNqYvntHiJuAY1u0KMNsSOazslvaQ/s1600/copy1.jpg" height="268" width="640" /></a></div>
<br />
<strong><u>Step 2</u></strong><br />
<strong><u></u></strong><br />
Enter HDFS host and port information.<br />
Similarly provide Job tracker host name and port.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgk9FoZsyEMqwFzAg8dGyOUUt2yWDApcrompHddIaXnPKOB5WJoHiUc8Zv97lqyNncUi8p59GM3KAg840vLjz1jAR5PajLWiu8RVejE2KdM08VDPLInIutaO_duQzc2z_w9zqhS2VeURc0/s1600/copy2.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgk9FoZsyEMqwFzAg8dGyOUUt2yWDApcrompHddIaXnPKOB5WJoHiUc8Zv97lqyNncUi8p59GM3KAg840vLjz1jAR5PajLWiu8RVejE2KdM08VDPLInIutaO_duQzc2z_w9zqhS2VeURc0/s1600/copy2.jpg" height="520" width="640" /></a></div>
<br />
<strong><u>Step 3</u></strong><br />
<strong><u></u></strong><br />
Enter name and location of the pig script or select the script using "Browse" option.<br />
Check "Enable Blocking" in case down stream entries need to wait for successful execution of the Pig script. Local execution option can be enabled for local testing purposes.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjCT1tqQ-zpS56UF50xmGyIf2eRsHgVrR8RyX-9tNI_XMHF6wqXMm9X4jf-x5nplHsF4cJxOoi5UiAO8ManfHpOii5cPrp4J7Vux4mSIfppGyXwT6nR0VQ7RbYrdQH_9qe9Ybb1Oz3K8Qg/s1600/copy3.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjCT1tqQ-zpS56UF50xmGyIf2eRsHgVrR8RyX-9tNI_XMHF6wqXMm9X4jf-x5nplHsF4cJxOoi5UiAO8ManfHpOii5cPrp4J7Vux4mSIfppGyXwT6nR0VQ7RbYrdQH_9qe9Ybb1Oz3K8Qg/s1600/copy3.jpg" height="372" width="640" /></a></div>
<br />
<u>Pig Script ( sum_trans_pig_script.pig )</u><br />
<u></u><br />
<span style="color: blue; font-size: x-small;">trans = load <span style="color: orange;"><strong>'/user/hue/transactions/store_transactions.csv'</strong></span> using PigStorage(',');<br />trans_line = FOREACH trans GENERATE TRIM($2) as storeName, $4 as transAmt;<br />grp_trans = GROUP trans_line by (storeName);<br />sum_amt = FOREACH grp_trans GENERATE group as grp,(ROUND(100f*SUM(trans_line.transAmt)))/100f as sum_amt;<br />STORE sum_amt INTO <span style="color: orange;"><strong>'/user/hue/transactions/trans_out'</strong></span> USING PigStorage(',');</span><br />
<span style="color: blue; font-size: x-small;"></span><br />
<u>Input CSV Data ( Sample )</u><br />
<br />
<span style="font-size: xx-small;"><span style="font-size: x-small;">[root@sandbox ~]# hadoop fs -cat /user/hue/transactions/store_transactions.csv|head -10</span><br />10001,2012-01-01 09:00:00.0,Fort Worth,Women's Clothing,153.57,Visa<br />10002,2012-01-01 09:00:00.0,San Diego,Music,66.08,Cash<br />10003,2012-01-01 09:00:00.0,Pittsburgh,Pet Supplies,493.51,Discover<br />10004,2012-01-01 09:00:00.0,Omaha,Children's Clothing,235.63,MasterCard<br />10005,2012-01-01 09:00:00.0,Stockton,Men's Clothing,247.18,MasterCard<br />10006,2012-01-01 09:00:00.0,Austin,Cameras,379.6,Visa<br />10007,2012-01-01 09:00:00.0,New York,Consumer Electronics,296.8,Cash<br />10008,2012-01-01 09:00:00.0,Corpus Christi,Toys,25.38,Discover<br />10009,2012-01-01 09:00:00.0,Fort Worth,Toys,213.88,Visa<br />10010,2012-01-01 09:00:00.0,Las Vegas,Video Games,53.26,Visa</span><br />
<span style="font-size: xx-small;"></span><br />
<br />
<strong><u>Step 4</u></strong><br />
<strong><u></u></strong><br />
<u>Execution Results</u><br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhdNeW0DFKsdciMGvmWO-kLGu_CScKQNdCEUxKQ_6xmywKD_lhlboTlBN2EgsBUAxrLf6REwvCLN64WBpUq9WnHeuPkJ4h0-8ktFeI2qcjO3aj1bXeUYisB1cDiy2kWj5A3jdOdeowyVjg/s1600/copy4.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhdNeW0DFKsdciMGvmWO-kLGu_CScKQNdCEUxKQ_6xmywKD_lhlboTlBN2EgsBUAxrLf6REwvCLN64WBpUq9WnHeuPkJ4h0-8ktFeI2qcjO3aj1bXeUYisB1cDiy2kWj5A3jdOdeowyVjg/s1600/copy4.jpg" height="354" width="640" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<u>Output CSV Data ( Sample )</u></div>
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<span style="font-size: x-small;">[root@sandbox ~]# hadoop fs -cat /user/hue/transactions/trans_out/part-r-00000</span></div>
<span style="font-size: xx-small;">Mesa,27046.94</span><br />
<span style="font-size: xx-small;">Reno,23507.56</span><br />
<span style="font-size: xx-small;">Boise,24273.33</span><br />
<span style="font-size: xx-small;">Miami,25034.11</span><br />
<span style="font-size: xx-small;">Omaha,28500.32</span><br />
<span style="font-size: xx-small;">Plano,26486.6</span><br />
<span style="font-size: xx-small;">Tampa,22132.69</span><br />
<span style="font-size: xx-small;">Tulsa,18669.55</span><br />
<span style="font-size: xx-small;">Aurora,28181.13</span><br />
<span style="font-size: xx-small;">Austin,25597.62</span><br />
<span style="font-size: xx-small;"></span><br />
<u>Execution Log</u><br />
<u></u><br />
<span style="font-size: xx-small;">2014/02/18 16:13:11 - Spoon - Asking for repository<br />2014/02/18 16:13:11 - Version checker - OK<br />2014/02/18 16:13:17 - Spoon - Starting job...<br />2014/02/18 16:13:17 - job_demo_pig - Start of job execution<br />2014/02/18 16:13:17 - job_demo_pig - Starting entry [Pig Script Executor]<br />2014/02/18 16:13:17 - Pig Script Executor - 2014/02/18 16:13:17 - Connecting to hadoop file system at: hdfs://192.168.154.131:8020<br />2014/02/18 16:13:18 - Pig Script Executor - 2014/02/18 16:13:18 - Connecting to map-reduce job tracker at: 192.168.154.131:8021<br />2014/02/18 16:13:19 - Pig Script Executor - 2014/02/18 16:13:19 - Setting Parallelism to 1<br />2014/02/18 16:13:19 - Pig Script Executor - 2014/02/18 16:13:19 - creating jar file Job7705643851097701358.jar<br />2014/02/18 16:13:21 - Pig Script Executor - 2014/02/18 16:13:21 - jar file Job7705643851097701358.jar created<br />2014/02/18 16:13:21 - Pig Script Executor - 2014/02/18 16:13:21 - 1 map-reduce job(s) waiting for submission.<br />2014/02/18 16:13:21 - Pig Script Executor - 2014/02/18 16:13:21 - Total input paths to process : 1<br />2014/02/18 16:13:21 - Pig Script Executor - 2014/02/18 16:13:21 - Total input paths (combined) to process : 1<br />2014/02/18 16:13:22 - Pig Script Executor - 2014/02/18 16:13:22 - HadoopJobId: job_1392722352996_0029<br />2014/02/18 16:13:22 - Pig Script Executor - 2014/02/18 16:13:22 - Processing aliases grp_trans,sum_amt,trans,trans_line<br />2014/02/18 16:13:22 - Pig Script Executor - 2014/02/18 16:13:22 - detailed locations: M: trans[1,8],trans_line[2,13],sum_amt[4,10],grp_trans[3,12] C: sum_amt[4,10],grp_trans[3,12] R: sum_amt[4,10]</span><span style="font-size: xx-small;">2014/02/18 16:13:22 - Pig Script Executor - 2014/02/18 16:13:22 - 0% complete<br />2014/02/18 16:13:33 - Pig Script Executor - 2014/02/18 16:13:33 - 50% complete<br />2014/02/18 16:13:42 - Pig Script Executor - 2014/02/18 16:13:42 - 100% complete<br />Input(s):<br />Successfully read 10001 records (621892 bytes) from: "/user/hue/transactions/store_transactions.csv"</span><br />
<span style="font-size: xx-small;">Output(s):<br />Successfully stored 103 records (1912 bytes) in: "/user/hue/transactions/trans_out"</span><br />
<span style="font-size: xx-small;">Counters:<br />Total records written : 103<br />Total bytes written : 1912<br />Spillable Memory Manager spill count : 0<br />Total bags proactively spilled: 0<br />Total records proactively spilled: 0</span><br />
<span style="font-size: xx-small;">Job DAG:<br />job_1392722352996_0029<br />2014/02/18 16:13:42 - Pig Script Executor - 2014/02/18 16:13:42 - Success!<br />2014/02/18 16:13:42 - Pig Script Executor - Num successful jobs: 1 num failed jobs: 0<br />2014/02/18 16:13:42 - job_demo_pig - Finished job entry [Pig Script Executor] (result=[true])<br />2014/02/18 16:13:42 - job_demo_pig - Job execution finished<br />2014/02/18 16:13:42 - Spoon - Job has ended.</span>Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com1tag:blogger.com,1999:blog-7135267265798115356.post-27502263989172526982014-01-30T16:00:00.000-08:002014-02-20T13:37:39.582-08:00Pentaho Big Data : Hadoop File InputThe Hadoop File Input step can be used to extract data from Hadoop cluster. This step can read comma separated, tab delimited , fixed width and other common types of text files.<br />
<br />
Stepwise illustration on how to configure Pentaho Hadoop file input is given below.<br />
<br />
<em>Cloudera Quick Start VM used for demo purpose. Refer link for more info.</em><br />
<a href="http://www.cloudera.com/content/cloudera-content/cloudera-docs/DemoVMs/Cloudera-QuickStart-VM/cloudera_quickstart_vm.html"><em><span style="font-size: xx-small;">http://www.cloudera.com/content/cloudera-content/cloudera-docs/DemoVMs/Cloudera-QuickStart-VM/cloudera_quickstart_vm.html</span></em></a><br />
<a name='more'></a><div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiiowqmh69x8bWDjPHHBD1GKXyTSpQ1_dVqVehT5rduQQRGhWylSlbrhdsPkn0nZ3J6FcbkhxGNns5GW06N2mOYm53IVkZSHTw_Q4W1riP8xLAbDbk1SAL2S3BBFk_bI-pV7EuKr4Y-fDk/s1600/hadoop1.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiiowqmh69x8bWDjPHHBD1GKXyTSpQ1_dVqVehT5rduQQRGhWylSlbrhdsPkn0nZ3J6FcbkhxGNns5GW06N2mOYm53IVkZSHTw_Q4W1riP8xLAbDbk1SAL2S3BBFk_bI-pV7EuKr4Y-fDk/s1600/hadoop1.jpg" height="1" width="1" /></a></div>
<br />
<strong><u>Step 1</u></strong><br />
<strong><u></u></strong><br />
Open Spoon and create a new transformation.<br />
Expand Big data section on design palette and drag Hadoop File Input onto the Canvas.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjXdrV4Zk79kDB-0j2LuOJV-0_Rjduj9ETGVkkkGtJfacorpQAkQsiwcL7PPf9ckHulUOG-AeVtIaRlXRXFwV7gXfxBfM3wuUU__nwLw_Yf0H6MBnPkKG5TtDU2cUXwpUn6NEBqEIZ3PaM/s1600/step1.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjXdrV4Zk79kDB-0j2LuOJV-0_Rjduj9ETGVkkkGtJfacorpQAkQsiwcL7PPf9ckHulUOG-AeVtIaRlXRXFwV7gXfxBfM3wuUU__nwLw_Yf0H6MBnPkKG5TtDU2cUXwpUn6NEBqEIZ3PaM/s1600/step1.jpg" height="212" width="640" /></a></div>
<br />
<strong><u>Step 2</u></strong><br />
<strong><u></u></strong><br />
Configure the Input file name and location.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh8IVCFS470gE9gerGjaZMevx9vhNXpJ5gbQEzfi9HpPXsBWmnd3jHmlTEuwFLm_2UUBlCLI1QWhU_koRe53k47FpaEITDa_Ilj2O2XXzcYKtIWxwucmieovkFv5a3qb7k1yQBA_GlqYv4/s1600/step2.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh8IVCFS470gE9gerGjaZMevx9vhNXpJ5gbQEzfi9HpPXsBWmnd3jHmlTEuwFLm_2UUBlCLI1QWhU_koRe53k47FpaEITDa_Ilj2O2XXzcYKtIWxwucmieovkFv5a3qb7k1yQBA_GlqYv4/s1600/step2.jpg" height="214" width="640" /></a></div>
<br />
File "hdfs://${USER}:${PASSWORD}@localhost:8020/user/training/input/purchases.txt" used for demo purpose. Ensure that this file is available on HDFS.<br />
<br />
<span style="color: blue;"><em>[training@localhost conf]$ <strong>hadoop fs -ls /user/training/input</strong><br />Found 1 items<br />-rw-r--r-- 1 training supergroup 211312924 2014-01-26 13:33 /user/training/input/purchases.txt<br />[training@localhost conf]$</em> </span><br />
<br />
<em><span style="background-color: white; color: #990000;">[training@localhost ~]$ <strong>hadoop fs -cat /user/training/input/purchases.txt | head -10</strong><br />2012-01-01 09:00 San Jose Men's Clothing 214.05 Amex<br />2012-01-01 09:00 Fort Worth Women's Clothing 153.57 Visa<br />2012-01-01 09:00 San Diego Music 66.08 Cash<br />2012-01-01 09:00 Pittsburgh Pet Supplies 493.51 Discover<br />2012-01-01 09:00 Omaha Children's Clothing 235.63 MasterCard<br />2012-01-01 09:00 Stockton Men's Clothing 247.18 MasterCard<br />2012-01-01 09:00 Austin Cameras 379.6 Visa</span></em><br />
<em><span style="color: #990000;"></span></em><br />
<strong><u>Step 3</u></strong><br />
<strong><u></u></strong><br />
Configure the Input file type and format.<br />
Tab separated file used for demo.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgZFlYFwNVcIvri-hijK0Zyb5oD3qQHSwvgta1BdU3-Pzvki3hZwOgCqnW8Fs6A9G-PXEpuBKfPnLFODQEBcrXRvPRZOeSf-fG659jHDWSV5FNNFJA6sSFPxEJ3b0WNhu_3zqJmOrK6Hu8/s1600/step3.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgZFlYFwNVcIvri-hijK0Zyb5oD3qQHSwvgta1BdU3-Pzvki3hZwOgCqnW8Fs6A9G-PXEpuBKfPnLFODQEBcrXRvPRZOeSf-fG659jHDWSV5FNNFJA6sSFPxEJ3b0WNhu_3zqJmOrK6Hu8/s1600/step3.jpg" height="288" width="640" /></a></div>
<br />
<strong><u>Step 4</u></strong><br />
<strong><u></u></strong><br />
Configure field names.<br />
Can use "Get Fields" button if needed.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiy-WKGiZrkB1AeA7-VFklxWLY6bnfTeZ5_JFft7d-4I6P19bvLJ_njUIu5IyPtjxrBU-phlXHYuvPSw16idWNJOLtL2fl-znqFs1YQ2FnG9f7uhwhBwmURagfd9O4ek91OAG91y1LZFO8/s1600/step4.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiy-WKGiZrkB1AeA7-VFklxWLY6bnfTeZ5_JFft7d-4I6P19bvLJ_njUIu5IyPtjxrBU-phlXHYuvPSw16idWNJOLtL2fl-znqFs1YQ2FnG9f7uhwhBwmURagfd9O4ek91OAG91y1LZFO8/s1600/step4.jpg" height="292" width="640" /></a></div>
<br />
<strong><u>Step 5</u></strong><br />
<strong><u></u></strong><br />
Use "Preview rows" option to examine source data.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh1wkKhRRc2hyuxzp3sLeLR1k8NeYXE4ETu7fkY85QHx29iHZF01HquaWZApmm1_EAVkyCZ705uoWG0Qe5gpu4AgJ5UTkxFqxjOCRQ49YJ9BQQCYuw7DwhxhJGupTpY8p8XmGmeb_KKoSA/s1600/step6.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh1wkKhRRc2hyuxzp3sLeLR1k8NeYXE4ETu7fkY85QHx29iHZF01HquaWZApmm1_EAVkyCZ705uoWG0Qe5gpu4AgJ5UTkxFqxjOCRQ49YJ9BQQCYuw7DwhxhJGupTpY8p8XmGmeb_KKoSA/s1600/step6.jpg" height="402" width="640" /></a></div>
<br />
<strong><u>Step 6</u></strong><br />
<strong><u></u></strong><br />
Pass user name and password as parameters.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEipPb6xl93tCHqKusK9ooPSModluW4kJytl14VSRjsv2JPFZ11frFyJ2lD8MZSG6G0AcTrEt5HFpMMNu03Ep4mPgFOt9DPfuWiEX4tIqYxHYMMNE7frNhZvvtmu8EEtRTow_e9yuNdrJgM/s1600/step7.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEipPb6xl93tCHqKusK9ooPSModluW4kJytl14VSRjsv2JPFZ11frFyJ2lD8MZSG6G0AcTrEt5HFpMMNu03Ep4mPgFOt9DPfuWiEX4tIqYxHYMMNE7frNhZvvtmu8EEtRTow_e9yuNdrJgM/s1600/step7.jpg" height="196" width="640" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<strong><u>Step 7</u></strong><br />
<strong><u></u></strong><br />
Execution log and Results.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi5EyLU17LqOlDPer7AStlD4ozQ_BQ2UY6Ag6-qfBltYDwCYz0WAMMO5tINTDvw9xyLwbr7Fh8XhY1cbG5O_nUy82FjB8RrqB6SPhp6uTjT8zisuZ3YLdx-ucWJaSQOEL7HfKyNUrNS7WQ/s1600/step8.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi5EyLU17LqOlDPer7AStlD4ozQ_BQ2UY6Ag6-qfBltYDwCYz0WAMMO5tINTDvw9xyLwbr7Fh8XhY1cbG5O_nUy82FjB8RrqB6SPhp6uTjT8zisuZ3YLdx-ucWJaSQOEL7HfKyNUrNS7WQ/s1600/step8.jpg" height="194" width="640" /></a></div>
<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj8CkkrkoVK8lhm9NeiEiKVngw3jSzoJY4FUie33zDNpH6XVl6RpfdU-SKPQzS0fu3xRxmLshDeb8ufCoZ9BKVmBt_pRgeE4-QW53SpJAzbvLv8Pu3SKPYAC70BdR-sl1pUwh0troQItRk/s1600/step9.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj8CkkrkoVK8lhm9NeiEiKVngw3jSzoJY4FUie33zDNpH6XVl6RpfdU-SKPQzS0fu3xRxmLshDeb8ufCoZ9BKVmBt_pRgeE4-QW53SpJAzbvLv8Pu3SKPYAC70BdR-sl1pUwh0troQItRk/s1600/step9.jpg" height="224" width="640" /></a></div>
<br />
<em><span style="font-size: x-small;">2014/01/29 15:22:53 - Spoon - Transformation opened.<br />2014/01/29 15:22:53 - Spoon - Launching transformation [tr_testar1]...<br />2014/01/29 15:22:53 - Spoon - Started the transformation execution.<br />2014/01/29 15:22:53 - tr_testar1 - Dispatching started for transformation [tr_testar1]<br />2014/01/29 15:22:54 - Hadoop File Input.0 - Opening file: hdfs://cloudera:***@localhost:8020/user/training/input/purchases.txt<br />2014/01/29 15:23:35 - Hadoop File Input.0 - Finished processing (I=4138476, O=0, R=0, W=4138476, U=1, E=0)<br />2014/01/29 15:23:35 - Dummy (do nothing).0 - Finished processing (I=0, O=0, R=4138476, W=4138476, U=0, E=0)<br />2014/01/29 15:23:35 - Spoon - The transformation has finished!!</span></em>Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com30tag:blogger.com,1999:blog-7135267265798115356.post-55261054002996659072014-01-24T14:03:00.001-08:002015-08-28T22:17:16.501-07:00Pentaho Data Integration : JSON input StepJSON (JavaScript Object Notation) is a text based, light weight data inter change format.<br />
This format enjoys a wide availability of implementations and is platform independent.<br />
<br />
Stepwise illustration on the usage of Pentaho JSON step given below.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgyLwIDDXhYYJf1D3uRmPwIuVA7NXuOEkP3q8ZmwCx7xQA_mSZ_OarZ5XnMsnLyaczdY6b60vb24axdUnI7tocpsgJ61UiFVv4NM8SAeblQsfWTSfCT2OguTnV5NtgDh1YQQJsTgvVscwU/s1600/json.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="1" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgyLwIDDXhYYJf1D3uRmPwIuVA7NXuOEkP3q8ZmwCx7xQA_mSZ_OarZ5XnMsnLyaczdY6b60vb24axdUnI7tocpsgJ61UiFVv4NM8SAeblQsfWTSfCT2OguTnV5NtgDh1YQQJsTgvVscwU/s1600/json.jpg" width="1" /></a></div>
<br />
<strong><u>Step 1</u></strong><br />
<strong><u></u></strong><br />
<a name='more'></a><br />
Create sample transformation with JSON input step.<br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh2lviSt75AeLVMkDpBg0mtxh8dMuRBeJ8WNi4FSQLYfnXlIFw54lt8hoi894sJxsGZAkD3Hcwof3HPfhclRQhLEVTUxvuegER17NPBw8i-6mMIQWjbZlZJVBxJsyAQ4Rq3EbbkIEEjA4M/s1600/copy1.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="176" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh2lviSt75AeLVMkDpBg0mtxh8dMuRBeJ8WNi4FSQLYfnXlIFw54lt8hoi894sJxsGZAkD3Hcwof3HPfhclRQhLEVTUxvuegER17NPBw8i-6mMIQWjbZlZJVBxJsyAQ4Rq3EbbkIEEjA4M/s1600/copy1.jpg" width="400" /></a><br />
<br />
<strong><u>Step 2</u></strong><br />
<strong><u></u></strong><br />
Browse and add sample test file in JSON format.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgLX5uxE42q6xoxMUR_oWTtsJ5kNNSBOLq0GnOXwwbUhocvKLFr-vxhw3hp0TfI-uBcpwks8LHqCHtD34lBaw0pScTN-24LRx_5COF1gs7O007E65wcYuv8IwsTswbcLVZbOThRNhSjClU/s1600/copy6.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="201" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgLX5uxE42q6xoxMUR_oWTtsJ5kNNSBOLq0GnOXwwbUhocvKLFr-vxhw3hp0TfI-uBcpwks8LHqCHtD34lBaw0pScTN-24LRx_5COF1gs7O007E65wcYuv8IwsTswbcLVZbOThRNhSjClU/s1600/copy6.jpg" width="400" /></a></div>
<br />
<u>Sample JSON Input data file :</u><br />
Real time stock quotes in JSON format used as sample test input for demo purpose.<br />
<span style="color: blue; font-family: Arial, Helvetica, sans-serif; font-size: xx-small;"><em>{"stockdata":[</em></span><br />
<span style="font-size: xx-small;"></span><br />
<div style="text-align: left;">
<span style="font-size: xx-small;"><span style="color: blue; font-family: Arial, Helvetica, sans-serif;"><em>{"DAY_LOW":547.81,"BID":551.08,"OPEN":550.88,"52WK_LOW":385.1,"LAST_TRADE":551.08,"DAY_HIGH":557.29,"STOCK_NAME":"APPLE INC",<br />"RUN_DATE":"2014-01-22 12:10:04.000000000","VOLUME":11654341,"STOCK_SYMBOL":"AAPL","52WK_HIGH":575.14,"ASK":551.17},</em></span></span></div>
<span style="font-size: xx-small;">
</span><br />
<div style="text-align: left;">
<span style="font-size: xx-small;"><span style="color: blue; font-family: Arial, Helvetica, sans-serif;"><em>{"DAY_LOW":1158.86,"BID":1162.3,"OPEN":1166.6,"52WK_LOW":735.79,"LAST_TRADE":1162.54,"DAY_HIGH":1167.89,"STOCK_NAME":"GOOGLE INC","RUN_DATE":"2014-01-22 12:10:04.000000000","VOLUME":1064205,"STOCK_SYMBOL":"GOOG","52WK_HIGH":1164.0,"ASK":1162.62},</em></span></span></div>
<span style="font-size: xx-small;">
</span><br />
<div style="text-align: left;">
<span style="font-size: xx-small;"><span style="color: blue; font-family: Arial, Helvetica, sans-serif;"><em>{"DAY_LOW":57.1,"BID":57.26,"OPEN":58.8,"52WK_LOW":22.67,"LAST_TRADE":57.26,"DAY_HIGH":59.31,"STOCK_NAME":"FACEBOOK",</em></span></span></div>
<span style="font-size: xx-small;">
</span><br />
<div style="text-align: left;">
<span style="font-size: xx-small;"><span style="color: blue; font-family: Arial, Helvetica, sans-serif;"><em>"RUN_DATE":"2014-01-22 12:10:05.000000000","VOLUME":52050684,"STOCK_SYMBOL":"FB","52WK_HIGH":58.96,"ASK":57.27},</em></span></span></div>
<span style="font-size: xx-small;">
</span><div style="text-align: left;">
<span style="font-size: xx-small;"><span style="color: blue; font-family: Arial, Helvetica, sans-serif;"><em>{"DAY_LOW":39.32,"BID":40.06,"OPEN":39.66,"52WK_LOW":19.31,"LAST_TRADE":40.06,"DAY_HIGH":40.13,"STOCK_NAME":"YAHOO",<br />"RUN_DATE":"2014-01-22 12:10:05.000000000","VOLUME":8438596,"STOCK_SYMBOL":"YHOO","52WK_HIGH":41.72,"ASK":40.07},</em></span></span></div>
<span style="font-size: xx-small;">
<div style="text-align: left;">
<span style="color: blue; font-family: Arial, Helvetica, sans-serif;"><em>{"DAY_LOW":35.75,"BID":36.07,"OPEN":36.26,"52WK_LOW":27.1,"LAST_TRADE":36.08,"DAY_HIGH":36.32,"STOCK_NAME":"MICROSOFT",<br />"RUN_DATE":"2014-01-22 12:10:05.000000000","VOLUME":14150161,"STOCK_SYMBOL":"MSFT","52WK_HIGH":38.98,"ASK":36.08}</em></span></div>
<div style="text-align: left;">
<span style="color: blue; font-family: Arial, Helvetica, sans-serif;"><em>]}</em></span></div>
</span><br />
<div class="separator" style="clear: both; text-align: left;">
<strong><u>Step 3</u></strong></div>
<br />
Configure the field names and paths.<br />
This can vary based requirements.<br />
<br />
For example, individual field values can be retrieved using following paths.<br />
<br />
STOCK_SYMBOL --> $..STOCK_SYMBOL<br />
STOCK_NAME --> $..STOCK_NAME<br />
<div style="text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgevBl0ADwVjzIO_OQ7yer30MkGJuM3dsc_HA6BkSeNKJXKPY6lD-iIaaBqLpIqdraT_jypMFkZm7LDbs-0N1kl1R48Ccn4rs9VFZ9QaRkPQRxywzNVIfvPzmqXZUE-2BK39RZtVljwnME/s1600/copy2.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="554" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgevBl0ADwVjzIO_OQ7yer30MkGJuM3dsc_HA6BkSeNKJXKPY6lD-iIaaBqLpIqdraT_jypMFkZm7LDbs-0N1kl1R48Ccn4rs9VFZ9QaRkPQRxywzNVIfvPzmqXZUE-2BK39RZtVljwnME/s1600/copy2.jpg" width="640" /></a></div>
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgevBl0ADwVjzIO_OQ7yer30MkGJuM3dsc_HA6BkSeNKJXKPY6lD-iIaaBqLpIqdraT_jypMFkZm7LDbs-0N1kl1R48Ccn4rs9VFZ9QaRkPQRxywzNVIfvPzmqXZUE-2BK39RZtVljwnME/s1600/copy2.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"></a><br />
All records can be retrieved by specifying "*" wild card in the path.<br />
eg : $.stockdata.*<br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjs3cxSMTPTHA6PPud4YPf6bioMAfgXt4igPYeL8zk6HcEatI-Xqqk4Y9VRXDOdZaNfwFL6mEs9HXSyde85sA00WMtiVx2PhHkyn-2NMn6hvFUWgciINU7_VfD7HFVOgcyfYUB5VBIWxI4/s1600/copy4.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="262" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjs3cxSMTPTHA6PPud4YPf6bioMAfgXt4igPYeL8zk6HcEatI-Xqqk4Y9VRXDOdZaNfwFL6mEs9HXSyde85sA00WMtiVx2PhHkyn-2NMn6hvFUWgciINU7_VfD7HFVOgcyfYUB5VBIWxI4/s1600/copy4.jpg" width="640" /></a><br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<strong><u>Step 4</u></strong></div>
<br />
Preview source data.<br />
<br />
<strong><u>For Individual fields</u></strong><br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjtlABEWhUNZ7RXsappRm0zVk5J6I27fizJCZzhnvXtLHOM-B8gqC09ZIs9vYMt2kBTedmUBFKqOM3gwzXpM1opA2NGNS-ZjX6Ux5h3bYli6G8j7W1RSMGz54BOnnfUdL8cbW2NCWe2RMg/s1600/copy3.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="224" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjtlABEWhUNZ7RXsappRm0zVk5J6I27fizJCZzhnvXtLHOM-B8gqC09ZIs9vYMt2kBTedmUBFKqOM3gwzXpM1opA2NGNS-ZjX6Ux5h3bYli6G8j7W1RSMGz54BOnnfUdL8cbW2NCWe2RMg/s1600/copy3.jpg" width="640" /></a><br />
<br />
<strong><u>For all records</u></strong> <br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjMDQpc1H9-MsuoFd1FEvsSlbpd2lRAn8_MF_TYDLH00tQBoeSLvTiD0oClHx59iG9BfkL3BErfX3J3w1YanaruJcHA_-sTZa3U2tC614f35zCISum9TbbLFDP48Eq5k-V1AKhf3CTWP9U/s1600/copy5.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="100" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjMDQpc1H9-MsuoFd1FEvsSlbpd2lRAn8_MF_TYDLH00tQBoeSLvTiD0oClHx59iG9BfkL3BErfX3J3w1YanaruJcHA_-sTZa3U2tC614f35zCISum9TbbLFDP48Eq5k-V1AKhf3CTWP9U/s1600/copy5.jpg" width="640" /></a></div>
Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com10tag:blogger.com,1999:blog-7135267265798115356.post-53626180027626825632014-01-22T17:35:00.000-08:002014-02-20T13:38:57.709-08:00Pentaho Data Integration : HTTP ClientHTTP Client provide ability to perform a call to a base URL with parameter values and return result value as a string. Sample transformation given below.<br />
<br />
Free Yahoo finance API to download stock quotes used here for demo purpose.<br />
Current stock prices with a 15 minute delay can be retrieved using this API.<br />
Service return data in CSV format.<br />
<br />
Base URL : <a href="http://finance.yahoo.com/d/quotes.csv">http://finance.yahoo.com/d/quotes.csv</a><br />
<br />
<a name='more'></a><br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiKFya4ENnGuw-Zl0f5Y9008sfjHOZ40bbXFXen5RMzdXCUTSGjzjV0Sh086Cn0bNwWHhei8gjpBluGvw1iguboRWRaZP7tYkT1_7zEAG4DoXNrBaOAE2kmgjqY1V7ZKNGFj_5jckNs23M/s1600/http.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiKFya4ENnGuw-Zl0f5Y9008sfjHOZ40bbXFXen5RMzdXCUTSGjzjV0Sh086Cn0bNwWHhei8gjpBluGvw1iguboRWRaZP7tYkT1_7zEAG4DoXNrBaOAE2kmgjqY1V7ZKNGFj_5jckNs23M/s1600/http.jpg" height="1" width="1" /></a></div>
<br />
<strong><u>Parameters</u></strong><br />
<strong><u></u></strong><br />
s --> Stock symbols ( AAPL, GOOG, FB e.t.c )<br />
f --> List of flags. Here are some sample values for f.<br />
<br />
o : open<br />
a : Ask<br />
b : Bid<br />
l1: Last Trade Price<br />
h : Day’s High<br />
g : Day’s Low<br />
v : Volume<br />
k : 52 Week High<br />
j : 52 week Low<br />
<br />
Sample output for AAPL : <a href="http://finance.yahoo.com/d/quotes.csv?s=AAPL&f=oabl1hgvkj">http://finance.yahoo.com/d/quotes.csv?s=AAPL&f=oabl1hgvkj</a><br />
<strong><u></u></strong><br />
<strong><u>Sample Transformation</u></strong> <br />
<br />
Stepwise illustration to download stock quotes using HTTP Client step given below.<br />
Output formatted and inserted into MySQL Database table.<br />
<br />
<strong><u>Step 1</u></strong><br />
<strong><u></u></strong><br />
Create a new transformation. Drag CSV file input step to the design canvas.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgibyFo8ufSM8hQVbWQ-ZA7oEng2WsX0BAM8ns7tGpYv8wINEI8nGdiIj2iy4oqGaDYCd-GWL3VnpxvI0YJkFkm6nF0Huc2V8pAl6SrqLEy8Kf1gKWc-y1-aNZlVLn0EluTU1S37t-oyGE/s1600/copy1.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgibyFo8ufSM8hQVbWQ-ZA7oEng2WsX0BAM8ns7tGpYv8wINEI8nGdiIj2iy4oqGaDYCd-GWL3VnpxvI0YJkFkm6nF0Huc2V8pAl6SrqLEy8Kf1gKWc-y1-aNZlVLn0EluTU1S37t-oyGE/s1600/copy1.jpg" height="197" width="400" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<strong><u>Step 2</u></strong><br />
<strong><u></u></strong><br />
Configured CSV input step to read input data.<br />
STOCK_SYMBOL and STOCK_NAME are the input values here.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjOpJLNn3erl2UBPfkU1Q6HalRwJYMVi3hjHqhBFuhBky9MGPeaVbT9G88qsMpCa31S26fnw0tKhPAYanBR_xxfR0l9PryvCdScbSJsphfCexHNZ0wK4eARU15CMvzrw9qKDpP34j-HaBU/s1600/copy2.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjOpJLNn3erl2UBPfkU1Q6HalRwJYMVi3hjHqhBFuhBky9MGPeaVbT9G88qsMpCa31S26fnw0tKhPAYanBR_xxfR0l9PryvCdScbSJsphfCexHNZ0wK4eARU15CMvzrw9qKDpP34j-HaBU/s1600/copy2.jpg" height="237" width="400" /></a></div>
<br />
<u>Preview of input test data</u><br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEghdESJVkdYh6p2MzN6TgvCGIjuBN1GH4xx21cU8wFketpkmjPIAUzg5rCYEwTcIG5Wwb6Y9fq3WLmmg0sSH_rOl2zG1-E9Z_ijU7udFM6K-0xoUIvUABSYDK1YO15lumeKm-6KkRvcG9Q/s1600/copy3.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEghdESJVkdYh6p2MzN6TgvCGIjuBN1GH4xx21cU8wFketpkmjPIAUzg5rCYEwTcIG5Wwb6Y9fq3WLmmg0sSH_rOl2zG1-E9Z_ijU7udFM6K-0xoUIvUABSYDK1YO15lumeKm-6KkRvcG9Q/s1600/copy3.jpg" height="226" width="400" /></a></div>
<br />
<strong><u>Step 3</u></strong><br />
<strong><u></u></strong><br />
"Add constants" step used to generate the Quote Type parameter.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgxD9HVxDaqbxjRB0IpTE4rTQSfAHUTmBuX-7eTTw4F4RRrszItYD-awAOOQfOoIRx_UcWrNockVy77RsBGfQ3zhk-Mwhinr9Hf5Nw-2kG1wxfAMG8pcGMqs-PluJOyZmvOLTjr1ggUGRo/s1600/copy4.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgxD9HVxDaqbxjRB0IpTE4rTQSfAHUTmBuX-7eTTw4F4RRrszItYD-awAOOQfOoIRx_UcWrNockVy77RsBGfQ3zhk-Mwhinr9Hf5Nw-2kG1wxfAMG8pcGMqs-PluJOyZmvOLTjr1ggUGRo/s1600/copy4.jpg" height="156" width="400" /></a></div>
<br />
Parameter value "oabl1hgvkj" used here for demo purpose.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjZUU_Gu3-OI6llQm_Yjhxmx8Jw3VkBUvpw1NK2jXR7y6ptCxq2SVUQ6VZvSOiyjuopQepjkQ5992TPhus3QcEzkFLJcV-CNq-5d-tvH5c5KgSVMrOagrhTx6e1_akxio9wgZa6BIA3_r0/s1600/copy5.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjZUU_Gu3-OI6llQm_Yjhxmx8Jw3VkBUvpw1NK2jXR7y6ptCxq2SVUQ6VZvSOiyjuopQepjkQ5992TPhus3QcEzkFLJcV-CNq-5d-tvH5c5KgSVMrOagrhTx6e1_akxio9wgZa6BIA3_r0/s1600/copy5.jpg" height="163" width="640" /></a></div>
<br />
<strong><u>Step 4</u></strong><br />
<strong><u></u></strong><br />
Configured HTTP Client step to call Yahoo Finance API.<br />
The service will return comma separated values based on the parameters passed.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEilgWGmvU0Brfj7eV-RmTuLG2PMxges9cqnkTiK3QofOnXB_VeViMGM_NwyyBp_LAzH6gfYyrG2KwbtO3ypV-iWma6iKJsqxWTcyS_7nboHac7fYEQAojEFRbEcrxqmAp6gTKochPjHr74/s1600/copy6.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEilgWGmvU0Brfj7eV-RmTuLG2PMxges9cqnkTiK3QofOnXB_VeViMGM_NwyyBp_LAzH6gfYyrG2KwbtO3ypV-iWma6iKJsqxWTcyS_7nboHac7fYEQAojEFRbEcrxqmAp6gTKochPjHr74/s1600/copy6.jpg" height="125" width="400" /></a></div>
<br />
URL set to <a href="http://finance.yahoo.com/d/quotes.csv">http://finance.yahoo.com/d/quotes.csv</a><br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEimGuw6YgXBWcO5ArRdIyREpCqN8hjXseD_2-WpYlHvVGzh6O5qbRkFV0rJJlpgsYKuQZC5pabwrYTjdqzR40D6KCsdshqoWtndAAWhvnFKuYL_Q0LkhpemRYkG9Kg01ht14CsYQnpaKnI/s1600/copy7.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEimGuw6YgXBWcO5ArRdIyREpCqN8hjXseD_2-WpYlHvVGzh6O5qbRkFV0rJJlpgsYKuQZC5pabwrYTjdqzR40D6KCsdshqoWtndAAWhvnFKuYL_Q0LkhpemRYkG9Kg01ht14CsYQnpaKnI/s1600/copy7.jpg" height="217" width="400" /></a><br />
<br />
Configured Stock Symbol : "s" and Quote Type : "f" parameters.<br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhuxRPGSpoy5bgdf0RNkgQ6okjntJzOJ9-h68rEig66chvQkGpd-NtO4J5WqGqwWaKit3nGRbEtYC1sexEyshxAaMap2ulu1tN0AXaQQ165Txfb5LQlSDfldqM9AEYvY5n4iucYADUejYg/s1600/copy8.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhuxRPGSpoy5bgdf0RNkgQ6okjntJzOJ9-h68rEig66chvQkGpd-NtO4J5WqGqwWaKit3nGRbEtYC1sexEyshxAaMap2ulu1tN0AXaQQ165Txfb5LQlSDfldqM9AEYvY5n4iucYADUejYg/s1600/copy8.jpg" height="151" width="400" /></a><br />
<br />
<strong><u>Step 5</u></strong><br />
<strong><u></u></strong><br />
Split Fields step used to convert coma separated "result" field into multiple columns.<br />
Column names are based on the quote type parameter. <br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgP9DnoviznIha3jkgh3NoqqdEl8y4cjn-oaLSO_gGBTPaKX3m0k_wkzGg9P84hox0UtoszumNRvmwRYyPNZjakd12KThDe3xleKxU1igftaLJUpFwWA3W-qNcMGS_LK0vxSXGocWjAB1c/s1600/copy9.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgP9DnoviznIha3jkgh3NoqqdEl8y4cjn-oaLSO_gGBTPaKX3m0k_wkzGg9P84hox0UtoszumNRvmwRYyPNZjakd12KThDe3xleKxU1igftaLJUpFwWA3W-qNcMGS_LK0vxSXGocWjAB1c/s1600/copy9.jpg" height="101" width="400" /></a><br />
<br />
Here are the list of columns for quote type "oabl1hgvkj".<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEicuBNjRF-R9Ft9zkPJ0GloiNF3EPY6gIW_3XILel_PFCJlVrSGzMd1tNq7C5BSQ-WNYJnKOR7523lQe5edovG82HzujXeTgFVFhJfOUkT8iFXFo5H_CniwWU3nbS3iXsRL9S0zP6dnpzU/s1600/copy10.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEicuBNjRF-R9Ft9zkPJ0GloiNF3EPY6gIW_3XILel_PFCJlVrSGzMd1tNq7C5BSQ-WNYJnKOR7523lQe5edovG82HzujXeTgFVFhJfOUkT8iFXFo5H_CniwWU3nbS3iXsRL9S0zP6dnpzU/s1600/copy10.jpg" height="206" width="400" /></a></div>
<br />
<strong><u>Step 6</u></strong><br />
<strong><u></u></strong><br />
Populate Run date and time using Get System Info.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjF98QShs_b1Ywxs19AW-uAHnz_ZvnJAkaP8XbSwAlEXS40wJINy17OaE8YdG6TuISb3xEjU4gOnEi7Bou6XbchaCdXiF-TPh8xquJArB6k8YNA9oEkNCs2XsFCOhBHMmC5rcl2NCfuaug/s1600/copy11.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjF98QShs_b1Ywxs19AW-uAHnz_ZvnJAkaP8XbSwAlEXS40wJINy17OaE8YdG6TuISb3xEjU4gOnEi7Bou6XbchaCdXiF-TPh8xquJArB6k8YNA9oEkNCs2XsFCOhBHMmC5rcl2NCfuaug/s1600/copy11.jpg" height="145" width="640" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<strong><u>Step 7</u></strong><br />
<strong><u></u></strong><br />
MySQL database table used here to store output data.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgL-KnYWPPJl8TS7-9xjtKPowWqbjX_to48yBRQ3wtUBfzFSaf1Qe5OUu7q7y4Qm_uudFCmukJa66Ywhuxm-8ZJPFe3SEbPpoaLGmKpcQx8VzdGWnODQFXn3OItitOkyjSw16_tQ_KS6sY/s1600/copy13.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgL-KnYWPPJl8TS7-9xjtKPowWqbjX_to48yBRQ3wtUBfzFSaf1Qe5OUu7q7y4Qm_uudFCmukJa66Ywhuxm-8ZJPFe3SEbPpoaLGmKpcQx8VzdGWnODQFXn3OItitOkyjSw16_tQ_KS6sY/s1600/copy13.jpg" height="246" width="640" /></a></div>
<br />
<strong><u>Step 8</u></strong><br />
<strong><u></u></strong><br />
Sample output data after couple of test runs.<br />
<div class="separator" style="clear: both; text-align: left;">
<img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiaj5svtRFCY_sVCaTGJmJ5-loI4G2f7NrJ9skJPUlOYuPRGJHDkJTb5dq5x_dnc98HPdD0rhR99ucXMoTcxhqDjPb7LFAPIsZlaSe5NTvdhkkbGQxxMCf0J14EuqMRQPFmFmXOjda7TKY/s1600/copy14.jpg" /></div>
Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com3tag:blogger.com,1999:blog-7135267265798115356.post-23510471180704585602014-01-20T13:45:00.001-08:002014-02-20T13:39:23.049-08:00Pentaho Data Integration 5.0.2 - Configure DI server ( Linux )Basic configuration steps for Pentaho Data Integration server given below. PDI installation using installation wizard on Linux OS is used for demo purposes. The server was installed on an included Apache Tomcat server.<br />
<div>
<br /></div>
<div>
<b><u>Step 1 : Start DI Server</u></b></div>
<div>
<b><u><br /></u></b></div>
<div>
Script "ctlscript.sh" can be used to manage the DI server. Here are the available script arguments.</div>
<div>
<a name='more'></a><div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj3twrrLSIKFF6SUPY4JSe1fsxDC3qVTIWn11nCZS2CVdovpR05P9UuaQ27Y1xU07OiUZPM5Gps2W2TOmVS7g6FcIg_1j6o9CBjvKoUjAuzu4f_JeffghhQgRsWLLTnv-75PsMHV4mI-Hk/s1600/pent11.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj3twrrLSIKFF6SUPY4JSe1fsxDC3qVTIWn11nCZS2CVdovpR05P9UuaQ27Y1xU07OiUZPM5Gps2W2TOmVS7g6FcIg_1j6o9CBjvKoUjAuzu4f_JeffghhQgRsWLLTnv-75PsMHV4mI-Hk/s1600/pent11.jpg" height="1" width="1" /></a></div>
<br /></div>
<div>
<div>
• start</div>
<div>
• stop</div>
<div>
• restart</div>
<div>
• status</div>
<div>
• help</div>
</div>
<div>
<br /></div>
<div>
<u>Starting the DI server </u></div>
<div>
<br /></div>
<div>
<div>
<i>[root@localhost pentaho]#<b> <span style="background-color: yellow;">/opt/pentaho/ctlscript.sh start data-integration-server</span></b></i></div>
<div>
<i>Using CATALINA_BASE: /opt/pentaho/server/data-integration-server/tomcat</i></div>
<div>
<i>Using CATALINA_HOME: /opt/pentaho/server/data-integration-server/tomcat</i></div>
<div>
<i>Using CATALINA_TMPDIR: /opt/pentaho/server/data-integration-server/tomcat/temp</i></div>
<div>
<i>Using JRE_HOME: /opt/pentaho/java</i></div>
<div>
<i>Using CLASSPATH: /opt/pentaho/server/data-integration-server/tomcat/bin/bootstrap.jar</i></div>
<div>
<i>/opt/pentaho/server/data-integration-server/scripts/ctl.sh : data-integration-server started</i></div>
<div>
<i>[root@localhost pentaho]# </i></div>
</div>
<div>
<i><br /></i></div>
<div>
<b><u>Step 2 : Start Spoon</u></b></div>
<div>
<b><u><br /></u></b></div>
<div>
Use script "spoon.sh" to start spoon on Linux OS.</div>
<div>
<br /></div>
<div>
<div>
<i>[root@localhost data-integration]# <b style="background-color: yellow;">/opt/pentaho/design-tools/data-integration/spoon.sh</b></i></div>
<div>
<i>/opt/pentaho/design-tools/data-integration</i></div>
<div>
<i>/opt/pentaho/design-tools/data-integration</i></div>
<div>
<i>DEBUG: Found JAVA two folders up</i></div>
<div>
<i>DEBUG: Found Pentaho License two folders up</i></div>
<div>
<i>DEBUG: _PENTAHO_JAVA_HOME=/opt/pentaho/design-tools/data-integration/../../java</i></div>
<div>
<i>DEBUG: _PENTAHO_JAVA=/opt/pentaho/design-tools/data-integration/../../java/bin/java</i></div>
<div>
<i>DEBUG: PENTAHO_INSTALLED_LICENSE_PATH=/opt/pentaho/design-tools/data-integration/../../.installedLicenses.xml</i></div>
</div>
<div>
<i><br /></i></div>
<div>
<b><u>Step 3 :</u></b></div>
<div>
<b><u><br /></u></b></div>
<div>
Connect to Pentaho Data Integration repository via option "Tools >> Repository >> Connect".</div>
<div>
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhs-mr0aGPSPdUdHJ172ZI0dxlf6NoR5Aiku0R6jega8IW0N0rZbA1BcOPPMUtD_kyU6JYQPnieYks_gFtvxuEXP3_XfUBzXp7n5KbSUczmoZIoowmxYXVt1BvMLjtCbHmH23SQ0PjmUUI/s1600/copy1.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhs-mr0aGPSPdUdHJ172ZI0dxlf6NoR5Aiku0R6jega8IW0N0rZbA1BcOPPMUtD_kyU6JYQPnieYks_gFtvxuEXP3_XfUBzXp7n5KbSUczmoZIoowmxYXVt1BvMLjtCbHmH23SQ0PjmUUI/s1600/copy1.jpg" height="117" width="400" /></a></div>
<br />
<div>
<b><u>Step 4 :</u></b></div>
<div>
<b><u><br /></u></b></div>
<div>
Repository connection wizard will pop up.</div>
<div>
Use "+" button to create a new repository.</div>
<div>
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgDdVchEDh4fW1NqJTlQsaNCBVII5DgApdt6rjH0B-xVWEqJJytOZkOl1KlNJjgplGgEQoGzjoTSR74-tb3c6XMBczk49rOqin1bqOP55NRb7qqggSAUm2NUTku27ftu-4slM_NSPd8QpU/s1600/copy2.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgDdVchEDh4fW1NqJTlQsaNCBVII5DgApdt6rjH0B-xVWEqJJytOZkOl1KlNJjgplGgEQoGzjoTSR74-tb3c6XMBczk49rOqin1bqOP55NRb7qqggSAUm2NUTku27ftu-4slM_NSPd8QpU/s1600/copy2.jpg" height="400" width="333" /></a></div>
<br />
<div>
<b><u>Step 5 :</u></b></div>
<div>
<b><u><br /></u></b></div>
<div>
Choose "DI Repository" as the repository type.</div>
<div>
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjzTE9wouTbxyAmonl7KIdKjHL69LNBylopQqWnPST74fH2Wz0htvvT8xVV1WuTdsxvBqV44jLHVbwatMi80uQMzJwXn3C_lAYmjvwtqGfHCPRnLA6_AuAI0P7liwxc_UBW7tysgDxq7Jc/s1600/copy3.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjzTE9wouTbxyAmonl7KIdKjHL69LNBylopQqWnPST74fH2Wz0htvvT8xVV1WuTdsxvBqV44jLHVbwatMi80uQMzJwXn3C_lAYmjvwtqGfHCPRnLA6_AuAI0P7liwxc_UBW7tysgDxq7Jc/s1600/copy3.jpg" height="128" width="400" /></a></div>
<br />
<div>
<b><u>Step 6 :</u></b></div>
<div>
<b><u><br /></u></b></div>
<div>
Repository Configuration screen will pop up. Replace default URL if needed. </div>
<div>
Enter meaningful ID and Name.</div>
<div>
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiECHjkrnqHB9_CW6YFUoRe-jtiex6KEk85qRVna0Ry7FpCbkhMIViPb9i4T7_IOkt2wNGPu7bD3n9edJntxHYWpMx3shWg0lbMEcdDZQpHbHE-deYkS0ihLkRmaGItqPtm4po1z1Gqi7o/s1600/copy4.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiECHjkrnqHB9_CW6YFUoRe-jtiex6KEk85qRVna0Ry7FpCbkhMIViPb9i4T7_IOkt2wNGPu7bD3n9edJntxHYWpMx3shWg0lbMEcdDZQpHbHE-deYkS0ihLkRmaGItqPtm4po1z1Gqi7o/s1600/copy4.jpg" height="230" width="400" /></a></div>
<br />
<div>
<b><u>Step 7 :</u></b></div>
<div>
<b><u><br /></u></b></div>
<div>
Click on "Test" button to verify repository URL.</div>
<div>
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg0SVcV-zi7Nxa2eXBE-p5WeRbzFcsU64rzC0ANusA0z7uQl1Y_O-0mt7wEj-RjtPL66xWz1_u0Yl-tytpEU8SyzvhyphenhyphenRnM1kUTN7pK4zf8Eh7V6gpzpQSySrjWUGDX1kJE01NRRmVAnZ-s/s1600/copy5.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg0SVcV-zi7Nxa2eXBE-p5WeRbzFcsU64rzC0ANusA0z7uQl1Y_O-0mt7wEj-RjtPL66xWz1_u0Yl-tytpEU8SyzvhyphenhyphenRnM1kUTN7pK4zf8Eh7V6gpzpQSySrjWUGDX1kJE01NRRmVAnZ-s/s1600/copy5.jpg" height="232" width="400" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<div>
<b><u>Step 8 :</u></b></div>
<div>
<b><u><br /></u></b></div>
<div>
Provide user credentials for the repository.</div>
<div>
<br /></div>
<div>
Default User Name : admin</div>
<div>
Default Password : password</div>
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhfbZB_F_O2T9xqFcxyj0kBuXT1qe8HNp6E_4oRjGsoBLQw23mfMFuRUk6jhLTZCQU6tOsxtYXrFl4ZC0sIIT5kyqRR8LW3fPc27PSo40EQSuQ6G2DlkmtnkVGaZC2wIkhA9s6g6xfDFxw/s1600/copy6.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhfbZB_F_O2T9xqFcxyj0kBuXT1qe8HNp6E_4oRjGsoBLQw23mfMFuRUk6jhLTZCQU6tOsxtYXrFl4ZC0sIIT5kyqRR8LW3fPc27PSo40EQSuQ6G2DlkmtnkVGaZC2wIkhA9s6g6xfDFxw/s1600/copy6.jpg" height="400" width="335" /></a></div>
<br />
<div>
<b><u>Step 9 :</u></b></div>
<div>
<b><u><br /></u></b></div>
<div>
Repository Log in - In progress.</div>
<div>
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg7G6T-CBTImgLhl6k4QzcInyuQsruq06dBhbeXHNwvhyphenhyphenVXh6ggjwONvpnqgqcUA2kHrUCwp7vdEpUf80jSfsCim7qChsNc-ApthfjK878Ai06Twi0sSfDOAPZPq7pft1aNFb0rLWnzfSQ/s1600/copy7.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg7G6T-CBTImgLhl6k4QzcInyuQsruq06dBhbeXHNwvhyphenhyphenVXh6ggjwONvpnqgqcUA2kHrUCwp7vdEpUf80jSfsCim7qChsNc-ApthfjK878Ai06Twi0sSfDOAPZPq7pft1aNFb0rLWnzfSQ/s1600/copy7.jpg" height="400" width="333" /></a></div>
<br />
<div>
<b><u>Step 10 :</u></b></div>
<div>
<b><u><br /></u></b></div>
<div>
Use option "Tools >> Repository >> Explore" to view and manage Repository.</div>
<div>
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh6BrgU79f16wxegCNL_K2kuMdDsVzpFLOkkMh8nxpLfZ_kFk-5I-3_FLMhuyqImim3W7T_VoSKl4xuQ4x9qtyYbTppefGI3_2uODLwkG4gcd66IjBDIW0HD6reTiNqo9MN2C_xAUasC-A/s1600/copy8.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh6BrgU79f16wxegCNL_K2kuMdDsVzpFLOkkMh8nxpLfZ_kFk-5I-3_FLMhuyqImim3W7T_VoSKl4xuQ4x9qtyYbTppefGI3_2uODLwkG4gcd66IjBDIW0HD6reTiNqo9MN2C_xAUasC-A/s1600/copy8.jpg" height="167" width="400" /></a></div>
<div>
<b><u><br /></u></b></div>
<div>
<br /></div>
Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com1tag:blogger.com,1999:blog-7135267265798115356.post-75184960418586237202014-01-05T17:21:00.006-08:002014-02-20T12:28:49.812-08:00Pentaho Business Analytics Enterprise Edition 5.0.2 - Installation for Linux<div class="separator" style="clear: both; text-align: left;">
Pentaho Business Analytics (BA) Suite can be installed in several ways. Install All Components method chosen for demo purpose. Both Business Analytics and Data Integration components will be installed. </div>
<div class="separator" style="clear: both; text-align: left;">
</div>
<div class="separator" style="clear: both; text-align: left;">
Here is a list of commonly used BA and DI components.</div>
<div class="separator" style="clear: both; text-align: left;">
</div>
<a name='more'></a><div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhXT7YckYJHd8NJ0owVGPOVxRPVTTBPMVRVhiigj9HqDc2WXqj0aRKyYZVn9Ya5bptrfsfzRC0ZFNaw-d63MeOvY9EavxeDpmfd8u3GVj4hYTodTGZVKpJhod5BiASr9c7Wbl8s61J-aK4/s1600/install.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhXT7YckYJHd8NJ0owVGPOVxRPVTTBPMVRVhiigj9HqDc2WXqj0aRKyYZVn9Ya5bptrfsfzRC0ZFNaw-d63MeOvY9EavxeDpmfd8u3GVj4hYTodTGZVKpJhod5BiASr9c7Wbl8s61J-aK4/s1600/install.jpg" height="1" width="1" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<br />
<div class="separator" style="clear: both; text-align: left;">
<strong><u>BA Components</u></strong></div>
<div class="separator" style="clear: both; text-align: left;">
<strong><u></u></strong> </div>
<ul>
<li><div class="separator" style="clear: both; text-align: left;">
BA Server : Pentaho host</div>
</li>
<li>User Console : Web based interface for administration, scheduling and security management.</li>
<li>Metadata Editor : Design tool to build metadata models and domains.</li>
<li>Report Designer : Design tool to develop reports.</li>
<li>Analyzer : Enable data visualization.</li>
<li>Aggregation Designer : Design tool to create aggregates.</li>
<li>Schema Workbench : Design tool to create complex multi dimensional models.</li>
</ul>
<br />
<strong><u>DI Components</u></strong><br />
<strong><u></u></strong><br />
<ul>
<li>DI Server : Host, store and manage DI components.</li>
<li>Spoon : Design tool to create jobs and transformations.</li>
<li>Kitchen : Command line tool to execute jobs.</li>
<li>Pan : Command line tool to execute transformations.</li>
<li>Carte : Web server to remote execute jobs.</li>
</ul>
<strong><u></u></strong><br />
Plz find stepwise illustration on how to install Pentaho Business Analytics version ( 5.0.2 ) on Linux.<br />
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<div class="separator" style="clear: both;">
<b><u>Step 1 :</u></b></div>
<div class="separator" style="clear: both;">
<br /></div>
<div class="separator" style="clear: both;">
Download latest Pentaho Business Analytics version ( 5.0.2 ) from http://pentaho.com/download</div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiJKCJy7_OPiCH_Fh0UFPRo99ucm6PXhQrb7thSzZOQzqnkneqC6PK4tJ4GdC6y1jZ0dDHbWRxPRFkOOwkmPGUJLBbDAamgaWa8LJ47vgdrqiCDynkjGhHt2KOnroyXrBEB1ba_8qRn59Y/s1600/copy1.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiJKCJy7_OPiCH_Fh0UFPRo99ucm6PXhQrb7thSzZOQzqnkneqC6PK4tJ4GdC6y1jZ0dDHbWRxPRFkOOwkmPGUJLBbDAamgaWa8LJ47vgdrqiCDynkjGhHt2KOnroyXrBEB1ba_8qRn59Y/s400/copy1.jpg" height="310" width="400" /></a></div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<b><u>Step 2</u></b></div>
<br />
Change permissions and execute pentaho-business-analytics-5.0.2-x64.bin<br />
<br />
<b><u></u> </b>[root@localhost Downloads]# chmod 777 pentaho-business-analytics-5.0.2-x64.bin<br />
<div class="separator" style="clear: both; text-align: left;">
[root@localhost Downloads]# ./pentaho-business-analytics-5.0.2-x64.bin</div>
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both;">
<b><u>Step 3 :</u></b></div>
<div class="separator" style="clear: both;">
<br /></div>
<div class="separator" style="clear: both;">
Setup wizard will pop up. Click Next.</div>
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh7FAu9ezIj7X1bY58TSrcb603OYxE3rqcDU8PkUZ4z0xzV8Vy-a6slgTogb5VkQ34NaJJCQrP-qjG684VaT1iBdzb5-Bh-6s10iz6-Qh3Gzc-rWF63QrF8M9ERNhLgdLxq2e5J_YJlWZY/s1600/copy3.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh7FAu9ezIj7X1bY58TSrcb603OYxE3rqcDU8PkUZ4z0xzV8Vy-a6slgTogb5VkQ34NaJJCQrP-qjG684VaT1iBdzb5-Bh-6s10iz6-Qh3Gzc-rWF63QrF8M9ERNhLgdLxq2e5J_YJlWZY/s400/copy3.jpg" height="331" width="400" /></a></div>
<br />
<div class="separator" style="clear: both;">
<b><u>Step 4 :</u></b></div>
<div class="separator" style="clear: both;">
<br /></div>
<div class="separator" style="clear: both;">
Read and Accept license agreements. Click Next.</div>
<div class="separator" style="clear: both;">
</div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgqCq6V_iVBicLVkFCMB14c4eJDjUZ-lPMrXLbFDfM1S6Gt1gB67RKFn2D2sIb1J6o_rCTtBIKcSlLGcwN57_Wry89oBUBYGKPMYUHfywT_8qDuc4ts5z1L-8xJGJTSOCEulJ_hvX1C8WU/s1600/copy4.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgqCq6V_iVBicLVkFCMB14c4eJDjUZ-lPMrXLbFDfM1S6Gt1gB67RKFn2D2sIb1J6o_rCTtBIKcSlLGcwN57_Wry89oBUBYGKPMYUHfywT_8qDuc4ts5z1L-8xJGJTSOCEulJ_hvX1C8WU/s400/copy4.jpg" height="301" width="400" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
</div>
<div class="separator" style="clear: both;">
<b><u>Step 5 :</u></b></div>
<div class="separator" style="clear: both;">
<br /></div>
<div class="separator" style="clear: both;">
Choose setup type based on requirements and needs.</div>
<div class="separator" style="clear: both;">
Default setup chosen for demo purpose.</div>
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh4m7p80wRialhgTehfG9bjn4gmiA8jPSv0EHEhZADZ70XzbpFQVLfsSh2N9TGVPmV6o2P8U6mv1Pzb8mm-EzdHZu2FM02ibW0HIPU_8uBv3Vjnz673CsD8vBl6KqogXuXTXe4OGAslVls/s1600/copy5.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh4m7p80wRialhgTehfG9bjn4gmiA8jPSv0EHEhZADZ70XzbpFQVLfsSh2N9TGVPmV6o2P8U6mv1Pzb8mm-EzdHZu2FM02ibW0HIPU_8uBv3Vjnz673CsD8vBl6KqogXuXTXe4OGAslVls/s400/copy5.jpg" height="300" width="400" /></a></div>
<br />
<div class="separator" style="clear: both;">
<b><u>Step 6 :</u></b></div>
<div class="separator" style="clear: both;">
<br /></div>
<div class="separator" style="clear: both;">
Specify installation location. Click Next.</div>
<div class="separator" style="clear: both;">
/opt/pentaho is the default location. </div>
<div class="separator" style="clear: both;">
</div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhMLwPe9dllWPtxgoqy1jLc0dLpmpr6-gKEcFtFCzC3Sc6IiUzwu2VxjgJu0rOph1vgQmqzoRNMLJw-t1-Ekd5f22zaQnGBXGCrdACtPjrMfcf2ETGxbpSKfq9Wm7ceXND7mIIYK6IkWuA/s1600/copy6.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhMLwPe9dllWPtxgoqy1jLc0dLpmpr6-gKEcFtFCzC3Sc6IiUzwu2VxjgJu0rOph1vgQmqzoRNMLJw-t1-Ekd5f22zaQnGBXGCrdACtPjrMfcf2ETGxbpSKfq9Wm7ceXND7mIIYK6IkWuA/s400/copy6.jpg" height="300" width="400" /></a></div>
<br />
<div class="separator" style="clear: both;">
<b><u>Step 7 :</u></b></div>
<div class="separator" style="clear: both;">
</div>
<div class="separator" style="clear: both;">
Configure password for PostgreSQL. Click Next.</div>
<div class="separator" style="clear: both;">
</div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhg0Ry_3tMKmlE_iLidVq6p5vYsn4w69iK8zB_XL2VpMhuc6_eH2_n7MccRCxIsRIcW9COJhAUdkuVyz1HcPonWB_hqTNxHmmvb5c-dfOHn8gyrv0X-PAPuboScOqOqOTRMquURg6tE9Ls/s1600/copy7.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhg0Ry_3tMKmlE_iLidVq6p5vYsn4w69iK8zB_XL2VpMhuc6_eH2_n7MccRCxIsRIcW9COJhAUdkuVyz1HcPonWB_hqTNxHmmvb5c-dfOHn8gyrv0X-PAPuboScOqOqOTRMquURg6tE9Ls/s400/copy7.jpg" height="303" width="400" /></a></div>
<br />
<div class="separator" style="clear: both;">
<b><u>Step 8 :</u></b></div>
<div class="separator" style="clear: both;">
<br /></div>
<div class="separator" style="clear: both;">
Setup ready to install. Click Next.</div>
<div class="separator" style="clear: both; text-align: center;">
</div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg-pUxJJ6PB9LsWNpOnBzPp_K6k4pl_kDIkPEbIaQbu0PkIy0jaKxrs6wUiwVuUF10KeQJqKkjAfnSIFZto7JXEY5ZtC2B7DTEg9p4b9qBi1ch7yWp5jt3lVhw2aEY-WVEDQmMqdtESc9o/s1600/copy8.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg-pUxJJ6PB9LsWNpOnBzPp_K6k4pl_kDIkPEbIaQbu0PkIy0jaKxrs6wUiwVuUF10KeQJqKkjAfnSIFZto7JXEY5ZtC2B7DTEg9p4b9qBi1ch7yWp5jt3lVhw2aEY-WVEDQmMqdtESc9o/s400/copy8.jpg" height="300" width="400" /></a></div>
<br />
<div class="separator" style="clear: both;">
<b><u>Step 9 :</u></b></div>
<div class="separator" style="clear: both;">
<br /></div>
<div class="separator" style="clear: both;">
Updating license files.</div>
<div class="separator" style="clear: both;">
Trial version used for demo purpose. License files updated automatically for trial versions.</div>
<div class="separator" style="clear: both;">
</div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj5Woj_WxnqTijh5ozPfn-2XOFI9tq5ppeXKo8AHLcHbIti20_fwe52uQ8fDpZAAC0ymA4mXpcqdCvKOxGkfiFa0E5i2J87OuRkpNygO4BXyKqbYqefebinXSgubr3rIWjK-5no94GS33o/s1600/copy9.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj5Woj_WxnqTijh5ozPfn-2XOFI9tq5ppeXKo8AHLcHbIti20_fwe52uQ8fDpZAAC0ymA4mXpcqdCvKOxGkfiFa0E5i2J87OuRkpNygO4BXyKqbYqefebinXSgubr3rIWjK-5no94GS33o/s400/copy9.jpg" height="298" width="400" /></a></div>
<br />
<div class="separator" style="clear: both;">
<b><u>Step 10 :</u></b></div>
<div class="separator" style="clear: both;">
<br /></div>
<div class="separator" style="clear: both;">
Installation in progress.</div>
<div class="separator" style="clear: both;">
</div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjy9-fG892_uR1fvV-hy4L37gVAoRNgpEuqNzHK3ljswQa_62MN7aZulSEoUpPzWD4cUr5tIgqsbIyLKMdNtVyMAL9s-PXXrlhUbdE7EquvhQ2hx9iltwEt86TRi6BkiKsD4X3JIzEcp_c/s1600/copy10.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjy9-fG892_uR1fvV-hy4L37gVAoRNgpEuqNzHK3ljswQa_62MN7aZulSEoUpPzWD4cUr5tIgqsbIyLKMdNtVyMAL9s-PXXrlhUbdE7EquvhQ2hx9iltwEt86TRi6BkiKsD4X3JIzEcp_c/s400/copy10.jpg" height="300" width="400" /></a></div>
<br />
<div class="separator" style="clear: both;">
<b><u>Step 11 :</u></b></div>
<div class="separator" style="clear: both;">
<br /></div>
<div class="separator" style="clear: both;">
Starting BA server.</div>
<div class="separator" style="clear: both;">
</div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh4grc4h05Hvhzk2EPTVOzgfZ3ODW4GjrnqoKPZc9XGAjmZjGKtBlpIJ03S-HwbTmFKpeu5_sIzKUJYkUkm0rDsEvd-OnLHFJoSApZ-KuDKzmqwbXqaoiZZkFtRpifSlvh6YdN2TfY_nSU/s1600/copy11.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh4grc4h05Hvhzk2EPTVOzgfZ3ODW4GjrnqoKPZc9XGAjmZjGKtBlpIJ03S-HwbTmFKpeu5_sIzKUJYkUkm0rDsEvd-OnLHFJoSApZ-KuDKzmqwbXqaoiZZkFtRpifSlvh6YdN2TfY_nSU/s400/copy11.jpg" height="302" width="400" /></a></div>
<br />
<div class="separator" style="clear: both;">
<b><u>Step 12 :</u></b></div>
<div class="separator" style="clear: both;">
<br /></div>
<div class="separator" style="clear: both;">
Installation Complete.</div>
<div class="separator" style="clear: both;">
</div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhpeZrhrWEQv7YtcrIMFEAsmB0kGJEbln75nTokRflBmAV4Gv_2ve3RzjVhoy_HleueYrLUEHXEwGyV_h7SWp-BwKFJ8slSGKh1u5jIZmBjcrM3sv5TKkKop1jMBgSE_FIfwSi0VXlQPqQ/s1600/copy12.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhpeZrhrWEQv7YtcrIMFEAsmB0kGJEbln75nTokRflBmAV4Gv_2ve3RzjVhoy_HleueYrLUEHXEwGyV_h7SWp-BwKFJ8slSGKh1u5jIZmBjcrM3sv5TKkKop1jMBgSE_FIfwSi0VXlQPqQ/s400/copy12.jpg" height="300" width="400" /></a></div>
<br />
<div>
<br /></div>
<div>
<b><u>Installation Summary</u></b></div>
<div>
<b><u></u></b><br />
<div>
</div>
</div>
<div>
<div>
Selections/Settings:</div>
<div>
<br /></div>
<div>
- PostgreSQL server port: 5432</div>
<div>
- HSQL server port: 9001</div>
<div>
- BA Server (Tomcat) startup port: 8080</div>
<div>
- BA Server shutdown port: 8012</div>
<div>
- Data Integration Server port: 9080</div>
<div>
- Monet Database Port: 50006 </div>
<div>
- Embedded Database Port: 9092</div>
<div>
<br /></div>
<div>
- Reporting Design Tools:</div>
<div>
- Report Designer</div>
<div>
- Metadata Editor</div>
<div>
<br /></div>
<div>
- Analysis Design Tools:</div>
<div>
- Schema Workbench</div>
<div>
- Aggregate Table Designer</div>
<div>
<br /></div>
<div>
- Dashboard Designer</div>
<div>
<br /></div>
<div>
- Data Integration (ETL)</div>
<div>
<br /></div>
<div>
- Analyzer</div>
<div>
<br /></div>
<div>
- Interactive Reporting</div>
<div>
<br /></div>
<div>
- Mobile</div>
<div>
<br /></div>
<div>
- Sample Content</div>
</div>
<div>
<br /></div>
Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com5tag:blogger.com,1999:blog-7135267265798115356.post-32655715773938512572014-01-03T20:05:00.000-08:002014-02-20T13:41:24.111-08:00Pentaho Data Integration : Microsoft Excel InputMicrosoft Excel Input step can be used to integrate data from various Excel sources including open office work books. This step can extract data from Excel 97-2003 ( xls ) files or Excel 2007 ( xlsx ) files. Plz find stepwise illustration below on how to configure Pentaho Excel Input Step.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg-YryZp0FbKL61Tx2SELjMZf2bGwnj1Mivy56Efquay-j2penD5piZxuUPmZMgAYgzB0Uf1zXagklvIqdohD8J5l4Xiu8K1jAgLpUethx-CkTfp3i5i4jclbOGhL9gaPX9lf1WNvHb-o4/s1600/copy6.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"></a> </div>
<strong><u></u></strong><br />
<strong><u>Step 1</u></strong><br />
<strong><u></u></strong><br />
Drag and drop Microsoft Excel Input Step into the Transformation design canvas.<br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg3TI-uL3X4Lk_GJMwoDBVcFjjX-g2W4iUciEg0X9HJAZsPlUuQ5KfOAQA0MAD-uOyJQsIcic4Xrm_u55FK1S4DeAZDLbU4vczgb5obsQbcedEPoT37X-P1VdlK-hJ54ij4bO8u4tKMdng/s1600/copy1.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg3TI-uL3X4Lk_GJMwoDBVcFjjX-g2W4iUciEg0X9HJAZsPlUuQ5KfOAQA0MAD-uOyJQsIcic4Xrm_u55FK1S4DeAZDLbU4vczgb5obsQbcedEPoT37X-P1VdlK-hJ54ij4bO8u4tKMdng/s400/copy1.png" height="146" width="400" /></a><br />
<a name='more'></a><div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgNUjoRHwl7BVgrKax4BPUFDgIh1Iv_HfpmRS8TGeGdfEn0o5etznxxb93d-VCjbgtzIuFlbuxG9gU7nKqEEuV_C7BRQZWiLSNSCMB0QUhegzMTGSQOTtQZmrAPtQQt1IEEvvE_q-vAqyg/s1600/xcel.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgNUjoRHwl7BVgrKax4BPUFDgIh1Iv_HfpmRS8TGeGdfEn0o5etznxxb93d-VCjbgtzIuFlbuxG9gU7nKqEEuV_C7BRQZWiLSNSCMB0QUhegzMTGSQOTtQZmrAPtQQt1IEEvvE_q-vAqyg/s1600/xcel.jpg" height="1" width="1" /></a></div>
<br />
<strong><u>Step 2</u></strong><br />
<strong><u></u></strong><br />
Click on Browse to select local Excel file with sample data.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj7s2gxg14PbYn3s4Sv61erXlxKXs-bx7F4PbVcY1FHEJhwUl3AOizP-PR8LZSD-NI6HhI0YALLdmNOOZkAdH90_im_Y1jYfMD4BHZ3KAmaclVn-2lyc_v_uf1SafqWHLzmTBeEBgkRMnA/s1600/copy2.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj7s2gxg14PbYn3s4Sv61erXlxKXs-bx7F4PbVcY1FHEJhwUl3AOizP-PR8LZSD-NI6HhI0YALLdmNOOZkAdH90_im_Y1jYfMD4BHZ3KAmaclVn-2lyc_v_uf1SafqWHLzmTBeEBgkRMnA/s400/copy2.png" height="233" width="400" /></a></div>
<br />
<strong><u>Step 3</u></strong><br />
<strong><u></u></strong><br />
Add the selected file.<br />
Also there are options available to add list of files or to specify wild cards.<br />
File names can be retrieved from previous steps using "Accept Filenames from previous steps" option.<br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg-YryZp0FbKL61Tx2SELjMZf2bGwnj1Mivy56Efquay-j2penD5piZxuUPmZMgAYgzB0Uf1zXagklvIqdohD8J5l4Xiu8K1jAgLpUethx-CkTfp3i5i4jclbOGhL9gaPX9lf1WNvHb-o4/s1600/copy6.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"></a><br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh4rL7E-RXl_gxXWhbfFCB1WRw9seiRwlA2HFtQBvSs-B2C-Cb8wxObwkFwpiwUZ5EtAb01LQrsCfP3lJmK88q1y2jMs7-2gIj9UuykuoqMWKZYU_9j9QrhIpir9EG3SMn7ZzgUamL43wI/s1600/copy3.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh4rL7E-RXl_gxXWhbfFCB1WRw9seiRwlA2HFtQBvSs-B2C-Cb8wxObwkFwpiwUZ5EtAb01LQrsCfP3lJmK88q1y2jMs7-2gIj9UuykuoqMWKZYU_9j9QrhIpir9EG3SMn7ZzgUamL43wI/s400/copy3.png" height="210" width="400" /></a><strong><u></u></strong></div>
<div class="separator" style="clear: both; text-align: left;">
<strong><u></u></strong> </div>
<div class="separator" style="clear: both; text-align: left;">
<strong><u>Step 4</u></strong></div>
<strong><u></u></strong><br />
Click on the Sheets tab to select list of sheets to read from.<br />
<span style="-webkit-text-stroke-width: 0px; color: #2d393b; display: inline !important; float: none; font-size-adjust: none; font-stretch: normal; font: 14px/16px Arial, Helvetica, sans-serif; letter-spacing: normal; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px;">"Get sheetnames" button will retrieve a list of available worksheets.</span><br />
<span style="-webkit-text-stroke-width: 0px; color: #2d393b; display: inline !important; float: none; font-size-adjust: none; font-stretch: normal; font: 14px/16px Arial, Helvetica, sans-serif; letter-spacing: normal; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px;">Select from the available items.</span><br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjP7mK6r-4EDQ2jyqXMFe3Afgk1VbP1UsWjgh9W9RQ6s286QrTauA_JfDdF6i4Dn1yR6GB9euk8KjBJBuiY-1IKa3WYlUNfkhlvHpU0S9RYNh6M1hX7Uv7JTI9KN_zUgHmBG99k3RH5dkw/s1600/copy4.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjP7mK6r-4EDQ2jyqXMFe3Afgk1VbP1UsWjgh9W9RQ6s286QrTauA_JfDdF6i4Dn1yR6GB9euk8KjBJBuiY-1IKa3WYlUNfkhlvHpU0S9RYNh6M1hX7Uv7JTI9KN_zUgHmBG99k3RH5dkw/s400/copy4.png" height="258" width="400" /></a><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiiVEI73N43CkOmf1uck5-nEzrvsCzBF_irb88GrCcwSwij4bSl0rWzE9ir1K7triB2o6gSdIBmwbmgDEUIcm69vt9ZwZk7p4JgfdrCl9PqZLL0W5QAtUQlItl0rtGpuLgH0HbZ3Y05Ohg/s1600/copy8.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"></a> </div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<strong><u>Step 5</u></strong><br />
<strong><u></u></strong><br />
Click on content tab to manage content properties.<br />
Following are some of the commonly used options.<br />
<br />
Header - First row will be skipped as header row.<br />
No Empty Rows - Empty rows will be skipped.<br />
Stop on Empty Row - Step will stop in case of any empty row.<br />
Limit - Limit the number of rows. Enter 0 to process all rows.<br />
<br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg-YryZp0FbKL61Tx2SELjMZf2bGwnj1Mivy56Efquay-j2penD5piZxuUPmZMgAYgzB0Uf1zXagklvIqdohD8J5l4Xiu8K1jAgLpUethx-CkTfp3i5i4jclbOGhL9gaPX9lf1WNvHb-o4/s1600/copy6.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEg-YryZp0FbKL61Tx2SELjMZf2bGwnj1Mivy56Efquay-j2penD5piZxuUPmZMgAYgzB0Uf1zXagklvIqdohD8J5l4Xiu8K1jAgLpUethx-CkTfp3i5i4jclbOGhL9gaPX9lf1WNvHb-o4/s400/copy6.png" height="256" width="400" /></a><br />
<br />
<strong><u>Step 6</u></strong><br />
<strong><u></u></strong><br />
Click on Fields tab to generate and configure fields.<br />
Use "Get fields from header row" button to export field names and attributes from sample data file sheet header row. Name and other attributes can be modified based on the requirements.<br />
Following are some of the commonly used options.<br />
<br />
Name - Field name.<br />
Type - Data type.<br />
Trim Type - Support left, right and both trim.<br />
Repeat - Default is N. Set to Y to repeat the value in case next field is empty.<br />
<br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEixYdSQIGaYd-bOhoNz5Dr2Sr_oObt30ZlVBx78zPcG8SrsBbp9NU5zZkTVylTrha2DO4DHG1mP2eH0t76QultG5UiWAyAAdCVWSVDF4vegZdDlEkv5I7oqkrepHYVj9KNHnRSFy-2e80I/s1600/copy7.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEixYdSQIGaYd-bOhoNz5Dr2Sr_oObt30ZlVBx78zPcG8SrsBbp9NU5zZkTVylTrha2DO4DHG1mP2eH0t76QultG5UiWAyAAdCVWSVDF4vegZdDlEkv5I7oqkrepHYVj9KNHnRSFy-2e80I/s400/copy7.png" height="260" width="400" /></a><br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhhFZqK05Y9t6mBIYbi8fb2mMV9a5LdX-j18-ZiIaNzkLnYqW4wJ3pHjVwtbkooZkpbL-BtvPwLwC5qXLZ60GyF69JMTwC3Em9Q8RVOrtiu2KI3Q0Sr1hkPO6QAx5geiDoUXM4qysLKzQM/s1600/copy5.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"></a> </div>
<strong><u>Step 7</u></strong><br />
<strong><u></u></strong><br />
Additional output fields can be configured if needed.<br />
<br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEioMuHLo6-Gpr235UEY6xrR_GsKpLlmgbYSslBXE-wrngXLTXNHh1qXCsW_EgaMeHuPETFTDhVgH8dQpDHcmbN0h_HIqPPjNh13PEK-NU2pRiG3E86fwqflZ0JablEMn5HRZkiC35-5IMo/s1600/copy6.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"></a><br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgzqBCSLuAxFLm9b2qTL7xKIgUcKQ40MRyo72AurAMpYIIrYuyrexuCgNpqIK_njJrdEVF1dvFOmZAhwFR3ciSV1ye98h7te5gnGrR5LZQVKxQJTUi-QzkR3__Ei66FiOQd9HguXvXeW5k/s1600/copy8.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgzqBCSLuAxFLm9b2qTL7xKIgUcKQ40MRyo72AurAMpYIIrYuyrexuCgNpqIK_njJrdEVF1dvFOmZAhwFR3ciSV1ye98h7te5gnGrR5LZQVKxQJTUi-QzkR3__Ei66FiOQd9HguXvXeW5k/s400/copy8.png" height="260" width="400" /></a> </div>
<br />
<strong><u>Step 8</u></strong><br />
<strong><u></u></strong><br />
Click on "Preview rows" button to examine the data.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjF4E9-QQR0viM45ub64jqUauSUHLMfmlTubT3vhF-VgsI455lblj86AClZWSVHPLJbxLuPDesfOF9SuKm0uHCBWSmNbOCcenND6GCAJ1ircBNK4q966Ev_1t-xiJ4THgyJgDNASAvB45g/s1600/copy7.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"></a> </div>
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjKUE5RAnsvOIJyNKLZCU_kNEiaSm8it-8wR5BXfNVqsyPvtdBNHPQa8Y8F9CnrHUYnxFZ9JcoSuqsdO8fyWkwDm4L8xDbT0jy283jlqCjk19LEEZnjW1qzCZ_bdWIzfTh97ly-062DnWg/s1600/copy9.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjKUE5RAnsvOIJyNKLZCU_kNEiaSm8it-8wR5BXfNVqsyPvtdBNHPQa8Y8F9CnrHUYnxFZ9JcoSuqsdO8fyWkwDm4L8xDbT0jy283jlqCjk19LEEZnjW1qzCZ_bdWIzfTh97ly-062DnWg/s640/copy9.png" height="155" width="640" /></a>Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com3tag:blogger.com,1999:blog-7135267265798115356.post-8475301708611849912013-12-25T19:02:00.001-08:002014-02-20T14:02:20.927-08:00Pentaho Data Integration - Configure Oracle JDBC ConnectionStepwise illustration on how to configure native JDBC Oracle database connections for Pentaho Data Integration. JDBC connections are the easiest and most commonly used access protocol. Connections can be configured using Spoon and managed by DI server.<br />
<br />
<strong><u>Step 1 :</u></strong><br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjIGnxblTsTh8MIAs324lK9cstYEiMfwTnJRNtoS7Z24IcgtrBiYomioo4sXCefPOSDd2ECYisV6dLNe2U7pja0MxH5eVNecx-NtMlgC9ejju6eKzK5YQHmNDJ_DCJMJHgcIeAbxVpXEos/s1600/chain.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjIGnxblTsTh8MIAs324lK9cstYEiMfwTnJRNtoS7Z24IcgtrBiYomioo4sXCefPOSDd2ECYisV6dLNe2U7pja0MxH5eVNecx-NtMlgC9ejju6eKzK5YQHmNDJ_DCJMJHgcIeAbxVpXEos/s1600/chain.jpg" height="1" width="1" /></a></div>
<div class="separator" style="clear: both; text-align: center;">
</div>
<div class="separator" style="clear: both; text-align: left;">
Open Spoon. </div>
Go to "Database Connection >> New Connection Wizard"<br />
<a name='more'></a><br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgqdgjGPQGdG6oa1xCqoDvP2NwT2Wjjnpq3Qom3kL8s6_wmLB2txBcH0io9mHmm6clQElIkLl4U2GYg0RaNbL05OYuAUvTnyUGZFpptTWDmA3ADiDve8ddWW8n1pha6T5gwscRHQ6Lisqw/s1600/copy1.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><strong><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgqdgjGPQGdG6oa1xCqoDvP2NwT2Wjjnpq3Qom3kL8s6_wmLB2txBcH0io9mHmm6clQElIkLl4U2GYg0RaNbL05OYuAUvTnyUGZFpptTWDmA3ADiDve8ddWW8n1pha6T5gwscRHQ6Lisqw/s400/copy1.png" height="202" width="400" /></strong></a></div>
<strong><u></u></strong><br />
<strong><u>Step 2 :</u></strong><br />
<br />
Connection wizard will pop up.<br />
Select database type as Oracle and access type as JDBC.<br />
Click Next.<br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEigwM2-k_vcLjHJrFLP51QlhozEl3qwTFgfi3WQhau4ttb_fBshSqVAncLcEIJL6d_UOpyq8GRm7R4qNxAiM4WoEHs7ZM_9tmXWClVTvbhzohuMH18NC2zz2FJTvAq7b8kGlBn_TyKmop0/s1600/copy2.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><strong><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEigwM2-k_vcLjHJrFLP51QlhozEl3qwTFgfi3WQhau4ttb_fBshSqVAncLcEIJL6d_UOpyq8GRm7R4qNxAiM4WoEHs7ZM_9tmXWClVTvbhzohuMH18NC2zz2FJTvAq7b8kGlBn_TyKmop0/s400/copy2.png" height="398" width="400" /></strong></a><br />
<strong><u></u></strong><br />
<strong><u>Step 3 :</u></strong><br />
<br />
Setup Oracle JDBC Settings.<br />
Provide database server host name, port and database sid.<br />
Click Next.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjsiOyUvrmayfkz402YPbQTGQYtUFcZoy2PdS0F6JJgcbMmwZtNOWEHprQ7B6iOWQGFlvxATj2ir80e7vFbmZ4IrOv4FMH2jr5OZWkO3lvuRp9tlCW3rQoTlt_xlww6il1vkHD1BoWSK3k/s1600/copy3.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjsiOyUvrmayfkz402YPbQTGQYtUFcZoy2PdS0F6JJgcbMmwZtNOWEHprQ7B6iOWQGFlvxATj2ir80e7vFbmZ4IrOv4FMH2jr5OZWkO3lvuRp9tlCW3rQoTlt_xlww6il1vkHD1BoWSK3k/s320/copy3.png" height="207" width="320" /></a></div>
<br />
<strong><u>Step 4 :</u></strong><br />
<br />
Provide user name and password to test the connection ( if needed ).<br />
Click Test database connection.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjwUfgnleFae_s0sRctTlZkD2OFPNnPLIGatgzrgXKaN-XfyNw4cN8QcG8dnSFnv2QOBcrK9786epTdqOEbplaC1hbN8R3kYrc3NLQzyAe3GLzywQLNsha9wmkoAp8hfpMSwM4srTKVPuw/s1600/copy4.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjwUfgnleFae_s0sRctTlZkD2OFPNnPLIGatgzrgXKaN-XfyNw4cN8QcG8dnSFnv2QOBcrK9786epTdqOEbplaC1hbN8R3kYrc3NLQzyAe3GLzywQLNsha9wmkoAp8hfpMSwM4srTKVPuw/s320/copy4.png" height="206" width="320" /></a></div>
<br />
<strong><u>Step 5 :</u></strong><br />
<br />
Connection report will pop up.<br />
Click Finish to create the connection.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjAsv6TZg9-J_dbAr3pigUpyqVOanfxFgOH9pY-iocCPQp3pePUcvh5fJY0g4kypmY4SVofCrj98-wMZUKLnkGzucJijIVjzqfu3TjiNmcW_YOlknri6-hm0xY_AGRbjpyyjVozm1OSP84/s1600/copy5.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjAsv6TZg9-J_dbAr3pigUpyqVOanfxFgOH9pY-iocCPQp3pePUcvh5fJY0g4kypmY4SVofCrj98-wMZUKLnkGzucJijIVjzqfu3TjiNmcW_YOlknri6-hm0xY_AGRbjpyyjVozm1OSP84/s320/copy5.png" height="91" width="320" /></a> </div>
Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com10tag:blogger.com,1999:blog-7135267265798115356.post-5155588003350049762013-12-25T18:39:00.000-08:002014-02-20T13:43:15.752-08:00Pentaho Common Errors : Driver class 'oracle.jdbc.driver.OracleDriver' could not be found<strong><u><span style="background-color: orange;">Error Message</span></u></strong><br />
<strong><u><span style="background-color: orange;"></span></u></strong><br />
<span style="font-size: x-small;">Error connecting to database [ORA_TEST_JDBC] : org.pentaho.di.core.exception.KettleDatabaseException: <br />Error occured while trying to connect to the database</span><br />
<span style="font-size: x-small;">Driver class 'oracle.jdbc.driver.OracleDriver' could not be found, make sure the 'Oracle' driver (jar file) is installed.<br />oracle.jdbc.driver.OracleDriver</span><br />
<a name='more'></a><br />
<span style="font-size: x-small;">org.pentaho.di.core.exception.KettleDatabaseException: <br />Error occured while trying to connect to the database</span><br />
<span style="font-size: x-small;">Driver class 'oracle.jdbc.driver.OracleDriver' could not be found, make sure the 'Oracle' driver (jar file) is installed.<br />oracle.jdbc.driver.OracleDriver</span><br />
<span style="font-size: x-small;">Caused by: java.lang.ClassNotFoundException: oracle.jdbc.driver.OracleDriver<br /> at java.net.URLClassLoader$1.run(Unknown Source)<br /> at java.net.URLClassLoader$1.run(Unknown Source)<br /> at java.security.AccessController.doPrivileged(Native Method)<br /> at java.net.URLClassLoader.findClass(Unknown Source)<br /> at java.lang.ClassLoader.loadClass(Unknown Source)<br /> at java.lang.ClassLoader.loadClass(Unknown Source)<br /> at org.pentaho.di.core.database.Database.connectUsingClass(Database.java:467)</span><br />
<br />
<strong><u><span style="background-color: lime;">Resolution</span></u></strong><br />
<strong><u><span style="background-color: orange;"></span></u></strong><br />
1) Download latest oracle jdbc driver.<br />
<br />
<a href="http://www.oracle.com/technetwork/database/enterprise-edition/jdbc-112010-090769.html">http://www.oracle.com/technetwork/database/enterprise-edition/jdbc-112010-090769.html</a><br />
ojdbc6 used here for demo purpose.<br />
<br />
2) Copy "ojdbc6.jar" to folder <br />
/pentaho/server/data-integration-server/tomcat/webapps/pentaho-di/WEB-INF/lib/<br />
<br />
3) Copy "ojdbc6.jar" to folder <br />
/pentaho/design-tools/data-integration/lib<br />
<br />
4) Restart Spoon.<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiyr7kiCEkv5N7ozKbL2Tu9DAyBASFpej5tPbL8hjn-WT4__jZp7oMEb6eYRkzzE_wfP9oLs5qNVQr9e0cXNaWQGdX-vtr-qCQ0j2IZcs7Wx1S7dPOKq3ZT_Wd-FKCiLvOvjC199t1MVas/s1600/bug.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiyr7kiCEkv5N7ozKbL2Tu9DAyBASFpej5tPbL8hjn-WT4__jZp7oMEb6eYRkzzE_wfP9oLs5qNVQr9e0cXNaWQGdX-vtr-qCQ0j2IZcs7Wx1S7dPOKq3ZT_Wd-FKCiLvOvjC199t1MVas/s1600/bug.jpg" height="1" width="1" /></a></div>
Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com20tag:blogger.com,1999:blog-7135267265798115356.post-19789431179697558952013-12-24T20:56:00.002-08:002014-02-20T14:05:49.111-08:00Pentaho Business Analytics Enterprise Edition 5.0.2 - Installation for Windows 64 bit<div class="separator" style="clear: both; text-align: left;">
</div>
<div class="separator" style="clear: both; text-align: left;">
<strong><u>Step 1 :</u></strong> </div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhRq8qkvzHp27488on54Vgl8uehdbTgcCj_6mpTlpAVuYoPnqaC0ky1HGid8Mg9id0ysDNOOgu89ba48PDocqXNlcNp7vk7HO-wOvUk9rQrz_foFSkyZnr1ZWa4H5An5TDhVBH5bMh8hZs/s1600/config.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhRq8qkvzHp27488on54Vgl8uehdbTgcCj_6mpTlpAVuYoPnqaC0ky1HGid8Mg9id0ysDNOOgu89ba48PDocqXNlcNp7vk7HO-wOvUk9rQrz_foFSkyZnr1ZWa4H5An5TDhVBH5bMh8hZs/s1600/config.jpg" height="1" width="1" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
Download latest Pentaho Business Analytics version ( 5.0.2 ) from <a href="http://pentaho.com/download">http://pentaho.com/download</a></div>
<div class="separator" style="clear: both; text-align: center;">
</div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiKW0hK4tHR5loSNtwwO9RZK4tLSlKaqQ6rkYuelFSKW2aJCBj0N-9dUtsDzXbtCBosjz8va5cKfUOHnIp125fd9zPmIltVMHElUC3HUCINm2-h29HgOTVMLUL2CRLLA9pRaIKkvVauSUg/s1600/copy1.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiKW0hK4tHR5loSNtwwO9RZK4tLSlKaqQ6rkYuelFSKW2aJCBj0N-9dUtsDzXbtCBosjz8va5cKfUOHnIp125fd9zPmIltVMHElUC3HUCINm2-h29HgOTVMLUL2CRLLA9pRaIKkvVauSUg/s320/copy1.png" height="181" width="320" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
</div>
<div class="separator" style="clear: both; text-align: left;">
<strong><u>Step 2 :</u></strong> </div>
<div class="separator" style="clear: both; text-align: left;">
</div>
<div class="separator" style="clear: both; text-align: left;">
Save and execute file pentaho-business-analytics-5.0.2-x64.exe.</div>
Installation wizard will pop up. Click Next.<br />
<a name='more'></a><br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEis5ugWFDF0H0bC0nDqpH8B8-4ruEzxNk2wx7uZNp3Cwu9NBd-LAmFtnpHQZzA_omg6Nv0tsIwp4pgkLT-JlOHlXkzL42kJN5bUPaycP7vRcWySSd3pVsYrAEQtei7gXup_24ptJ_VA4EQ/s1600/copy2.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEis5ugWFDF0H0bC0nDqpH8B8-4ruEzxNk2wx7uZNp3Cwu9NBd-LAmFtnpHQZzA_omg6Nv0tsIwp4pgkLT-JlOHlXkzL42kJN5bUPaycP7vRcWySSd3pVsYrAEQtei7gXup_24ptJ_VA4EQ/s320/copy2.png" height="232" width="320" /></a></div>
<br />
<strong><u>Step 3 :</u></strong> <br />
<br />
Accept license agreements and Click Next.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgB3IsStY0WRJ6Riw-LL7skhacJo6KbXg1FkY1PCeZtDwlaIyl_kqAb8U9LfDZeMmar4SfzGid4qUqPDYITztsNL25X8afkSRsQph0ZH0vtrjxtIN08ahEH7QYbLDeD804plLeuHWa7LIk/s1600/copy3.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgB3IsStY0WRJ6Riw-LL7skhacJo6KbXg1FkY1PCeZtDwlaIyl_kqAb8U9LfDZeMmar4SfzGid4qUqPDYITztsNL25X8afkSRsQph0ZH0vtrjxtIN08ahEH7QYbLDeD804plLeuHWa7LIk/s320/copy3.png" height="232" width="320" /></a></div>
<br />
<strong><u>Step 4 : </u></strong><br />
<strong><u></u></strong><br />
Choose installation Setup Type. Click Next.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiguYEHCIEKmx6K20Rld6lrh7wP65YNGB261H4oXskZhzJc8m8uphw31uCtjxL4s_xzjC_eEJNPo4YlqE4NknIEgc7qrg-UaUlu1u57oMbMYR4mxe0mZ4twpIVRiIhxi9DAlgos19uaKsA/s1600/copy4.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiguYEHCIEKmx6K20Rld6lrh7wP65YNGB261H4oXskZhzJc8m8uphw31uCtjxL4s_xzjC_eEJNPo4YlqE4NknIEgc7qrg-UaUlu1u57oMbMYR4mxe0mZ4twpIVRiIhxi9DAlgos19uaKsA/s320/copy4.png" height="232" width="320" /></a></div>
<br />
<strong><u>Step 5 : </u></strong><br />
<strong><u></u></strong><br />
Choose installation location. Click Next.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEim2jEIMjGGcVEGn9c5sX7gyto7iObO1LztZ4eMxSLR7q2h5n_vvCbmcz9WnTZNO4qEZOEZLDfy26b9WxgvtyLsCHZcRoYKIiSImjYaRGybnt3e1DzFwdFQiwNUhGKCcRFMfnYJWQ17cnY/s1600/copy5.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEim2jEIMjGGcVEGn9c5sX7gyto7iObO1LztZ4eMxSLR7q2h5n_vvCbmcz9WnTZNO4qEZOEZLDfy26b9WxgvtyLsCHZcRoYKIiSImjYaRGybnt3e1DzFwdFQiwNUhGKCcRFMfnYJWQ17cnY/s320/copy5.png" height="232" width="320" /></a></div>
<br />
<strong><u>Step 6 :</u></strong> <br />
<br />
Set up password for database postgres.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhH5YFov03QS1dqLFIgWd3QMkkRPAWKZ1-ZO21h0dgokHekrOWxkW7HAPq0MoRIxTXWKzHErjAArhMFDCeyCPCU6MIYyqzwG9HbBUk-dmmU3saCMzTz7xXh_Z3znuqbKxtTD-T_5SWzUrQ/s1600/copy6.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhH5YFov03QS1dqLFIgWd3QMkkRPAWKZ1-ZO21h0dgokHekrOWxkW7HAPq0MoRIxTXWKzHErjAArhMFDCeyCPCU6MIYyqzwG9HbBUk-dmmU3saCMzTz7xXh_Z3znuqbKxtTD-T_5SWzUrQ/s320/copy6.png" height="232" width="320" /></a></div>
<br />
<strong><u>Step 7 :</u></strong><br />
<br />
Set up will begin installing Pentaho Business Analytics.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhKk_5Sp9nOb7ugZenw6Oq9xrofB4NMq8PZLzoQnwga9UfaC9WuZW-9cvJkzuvBYMO5FHLUQmo0L530WLpJsgwndcbHOndggwySgU_htATHLHSG8uz-wX6OpI5i_sq2_9WGfLQRMHPxvbs/s1600/copy7.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhKk_5Sp9nOb7ugZenw6Oq9xrofB4NMq8PZLzoQnwga9UfaC9WuZW-9cvJkzuvBYMO5FHLUQmo0L530WLpJsgwndcbHOndggwySgU_htATHLHSG8uz-wX6OpI5i_sq2_9WGfLQRMHPxvbs/s320/copy7.png" height="233" width="320" /></a></div>
<br />
<strong><u>Step 8 :</u></strong> <br />
<br />
Evaluation licenses will be auto generated and installed for trial versions.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgtqyoCgPZgpXBRujQKn7vsgX8B_nkcon9OSJP0pHLqq_JYHyf9TBdqcgunpAd6f08oOGhG6og-LwGBesaafgdYMEatgi6YKaTou1LjUJcwVeIhVqyRn38iHc4m3ixFAt6FNR8llQZZJ0U/s1600/copy9.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgtqyoCgPZgpXBRujQKn7vsgX8B_nkcon9OSJP0pHLqq_JYHyf9TBdqcgunpAd6f08oOGhG6og-LwGBesaafgdYMEatgi6YKaTou1LjUJcwVeIhVqyRn38iHc4m3ixFAt6FNR8llQZZJ0U/s320/copy9.png" height="231" width="320" /></a></div>
<br />
<br />
<strong><u>Step 9 :</u></strong> <br />
<br />
Installing Pentaho Business Analytics Enterprise edition.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhpSPYRc0J_z4A7a9N0DtdbW6zr4GQinDvQbzeTRg-q4V9duOdtsZ2YESM5MfKLnZOExOsD5qXlYJGgBvCRcciQdmbrifIwD9J7O25Xp3aU-KNuZnr_ceuB6gm7xCzNrW3EYmNDzrz8gbk/s1600/copy10.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhpSPYRc0J_z4A7a9N0DtdbW6zr4GQinDvQbzeTRg-q4V9duOdtsZ2YESM5MfKLnZOExOsD5qXlYJGgBvCRcciQdmbrifIwD9J7O25Xp3aU-KNuZnr_ceuB6gm7xCzNrW3EYmNDzrz8gbk/s320/copy10.png" height="232" width="320" /></a></div>
<br />
<strong><u>Step 10 :</u></strong> <br />
<br />
Installation Complete.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgLhbzoEzoPyfCZ1TaShGR4dFNJqChdhtZORcgZU2YpiSLlgu9WQovJ5DPVyQiKdlrN8GoTQeg7cYQuad7W15xHE5Rv6cSscZ65I8nwoHhemaGUj54oDK2PpfztEnTAdm9tapSxPrGgvRU/s1600/copy11.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgLhbzoEzoPyfCZ1TaShGR4dFNJqChdhtZORcgZU2YpiSLlgu9WQovJ5DPVyQiKdlrN8GoTQeg7cYQuad7W15xHE5Rv6cSscZ65I8nwoHhemaGUj54oDK2PpfztEnTAdm9tapSxPrGgvRU/s320/copy11.png" height="232" width="320" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
</div>
<strong><u>Step 11 :</u></strong> <br />
<br />
Pentaho Business Analytics Installation Summary.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgGkQqoAn0MNeCmYIYJlR70_pl9JXQlLFd9-z8FW2QaVmXh4oacNbkf2XrvnLzDplqgBJNlHhVLIiYNt10D8twMFzo6YY7dRKRKUglI2WTQHzpTbLBzSRgblwQFzm5eCN9ZKFyeuGoYfgs/s1600/copy12.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgGkQqoAn0MNeCmYIYJlR70_pl9JXQlLFd9-z8FW2QaVmXh4oacNbkf2XrvnLzDplqgBJNlHhVLIiYNt10D8twMFzo6YY7dRKRKUglI2WTQHzpTbLBzSRgblwQFzm5eCN9ZKFyeuGoYfgs/s320/copy12.png" height="320" width="239" /></a></div>
<br />Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com1tag:blogger.com,1999:blog-7135267265798115356.post-72694436508113383302013-12-15T19:09:00.001-08:002014-02-20T13:46:32.701-08:00Pentaho Data Integration - Configure DI Server for Windows<b><u>Step 1 :</u></b><br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjPnYDBuOof8mQBDDTEytDAR1ycXaendpSgORbZ5dCMwf_U1MlyHAALFEJyIlZuJWzfxvK1aoYbup4jDMoJy6XgNGWk2chcskWcSMMoi5mDEePK3mepSE37NPBLjyiRi3oowWuz74K80vc/s1600/pent11.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjPnYDBuOof8mQBDDTEytDAR1ycXaendpSgORbZ5dCMwf_U1MlyHAALFEJyIlZuJWzfxvK1aoYbup4jDMoJy6XgNGWk2chcskWcSMMoi5mDEePK3mepSE37NPBLjyiRi3oowWuz74K80vc/s1600/pent11.jpg" height="1" width="1" /></a></div>
<b><u><br /></u><br /></b>
Go To "Start > Pentaho Enterprise Edition > Server Management"<br />
Start DI and Tomcat servers using icon "Start Data Integration Server"<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgeyIKjHJqXxX6zk9a77et8o-_AqVQyTFnSZyj04gb8PFiJMB9e3abZpYmvxvHYTx_wiP2UNHvgI8PXgiUsNTE7nVu7FkF9i_5ijoR9scTzKRuBD3eXxpS4GXvJp6gNqL-OeqmYxnE7fu4/s1600/copy1.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgeyIKjHJqXxX6zk9a77et8o-_AqVQyTFnSZyj04gb8PFiJMB9e3abZpYmvxvHYTx_wiP2UNHvgI8PXgiUsNTE7nVu7FkF9i_5ijoR9scTzKRuBD3eXxpS4GXvJp6gNqL-OeqmYxnE7fu4/s400/copy1.jpg" height="230" width="400" /></a></div>
<br />
<a name='more'></a><br />
<b><u>Step 2 :</u></b><br />
<b><u><br /></u></b>
Access DI repository using Spoon design tool.<br />
Spoon is a desktop application that can be used primarily as a graphical interface and editor for transformations and jobs.<br />
<br />
Go To "Start > Pentaho Enterprise Edition > Design Tools"<br />
Click on "Data Integration" to start spoon.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi7qs3KAajMiuDeN2oyKR6I8LMBgKOJOPk3ewwB4j2hp0AJkmIkKZmZXdCF1U1jy4Zwh46nKWdUeXe2LFPFfFeRWssrUmqsWBOTcdx8Vk3H8s82e6odVqHduGgj3PAhCUogDRQtzI1txw4/s1600/copy2.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi7qs3KAajMiuDeN2oyKR6I8LMBgKOJOPk3ewwB4j2hp0AJkmIkKZmZXdCF1U1jy4Zwh46nKWdUeXe2LFPFfFeRWssrUmqsWBOTcdx8Vk3H8s82e6odVqHduGgj3PAhCUogDRQtzI1txw4/s400/copy2.jpg" height="225" width="400" /></a></div>
<br />
<b><u>Step 3 :</u></b><br />
<b><u><br /></u></b>
Repository Connection dialogue box will pop up.<br />
Click on "+" button to add new repository connection.<br />
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh7PUxZE0q41Wlptpv-GnIx_I86EMVdtVz1aTny0UISfDxqffsieafqTc6jqTTJujLNHse9yokjdd4UYRtNJZ8ZzkWgpOI-yVClrlT9lApfzarrEzqoe3MWStB8kDq5HVRXR3iMCUV8qU4/s1600/copy4.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh7PUxZE0q41Wlptpv-GnIx_I86EMVdtVz1aTny0UISfDxqffsieafqTc6jqTTJujLNHse9yokjdd4UYRtNJZ8ZzkWgpOI-yVClrlT9lApfzarrEzqoe3MWStB8kDq5HVRXR3iMCUV8qU4/s400/copy4.jpg" height="400" width="337" /></a></div>
<br />
<b><u>Step 4 :</u></b><br />
<b><u><br /></u></b>
Repository type dialogue box will pop up.<br />
Select "DI Repository" and then click OK.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhQJAuBpJXsGcYFrxVkOS5qxQJ7PwOgjq8SGs1s7kk6tkwM1MZAUbERPAIMU7mywS4bUoesXQPEbIUmKKlo8jAwKw8rZRTJi1sRMSd5QZyOf2d80PrxPb_F1GjI-QitTPjmwRm30Xv5SLs/s1600/copy5.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhQJAuBpJXsGcYFrxVkOS5qxQJ7PwOgjq8SGs1s7kk6tkwM1MZAUbERPAIMU7mywS4bUoesXQPEbIUmKKlo8jAwKw8rZRTJi1sRMSd5QZyOf2d80PrxPb_F1GjI-QitTPjmwRm30Xv5SLs/s400/copy5.jpg" height="133" width="400" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<b><u>Step 5 :</u></b><br />
<b><u><br /></u></b>
Enter Repository connection information.<br />
Enter suitable names for ID and Name.<br />
<br />
Default URL : http://localhost:9080/pentaho-di<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhfpTNheIOW4OVgNJSABOM2wWRu6g_cgDTWWrQAeIkdVi4WsQnyJn7h954GbhPzBO3V2I57BBkRoUwEXXViDHy5stjHnGK-UfMx9TxY03EBe9145Rodpcshst4wkevsdEZjfZI77KNsrsY/s1600/copy6.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhfpTNheIOW4OVgNJSABOM2wWRu6g_cgDTWWrQAeIkdVi4WsQnyJn7h954GbhPzBO3V2I57BBkRoUwEXXViDHy5stjHnGK-UfMx9TxY03EBe9145Rodpcshst4wkevsdEZjfZI77KNsrsY/s400/copy6.jpg" height="273" width="400" /></a></div>
<br />
<b><u>Step 6 :</u></b><br />
<b><u><br /></u></b>Click on Test to verify whether the URL is correct.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh1gP_gp61lYgoGuIuerqoBvHNbLEQWn4XOf6gngMKIR9y6dq8AOFbFJc38uY4HIH_tWbcu9c89wORgR5ST0xQqfLvQ25UstNC5cxfBVVErkZAMS9xonkrpiUdWvl1nr4ddoRQJhv3OPLk/s1600/copy7.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh1gP_gp61lYgoGuIuerqoBvHNbLEQWn4XOf6gngMKIR9y6dq8AOFbFJc38uY4HIH_tWbcu9c89wORgR5ST0xQqfLvQ25UstNC5cxfBVVErkZAMS9xonkrpiUdWvl1nr4ddoRQJhv3OPLk/s400/copy7.jpg" height="290" width="400" /></a></div>
<br />
<b><u>Step 7 :</u></b><br />
<br />
Provide user name and password.<br />
Click OK to connect to the repository.<br />
<br />
Default User Name : admin<br />
Default Password : password<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiGnqf3pH78bAO29a2-mhPqC7McRtclKLFeV7OCqa8HFVJttrEe-zcPtQhZWc309NuVakshI_zkPYDdd6Urlmxvk8PZfixL_XVE7-lWaHsGUh1u_PL7UZFkTWpQlaoVmdE0wXtr_HpqCWc/s1600/copy8.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiGnqf3pH78bAO29a2-mhPqC7McRtclKLFeV7OCqa8HFVJttrEe-zcPtQhZWc309NuVakshI_zkPYDdd6Urlmxvk8PZfixL_XVE7-lWaHsGUh1u_PL7UZFkTWpQlaoVmdE0wXtr_HpqCWc/s400/copy8.jpg" height="400" width="336" /></a></div>
<br />
<b><u>Step 8 :</u></b><br />
<b><u><br /></u></b>Logging into the repository.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgJo95tV8B8VL_9CcUiA0gLp5kWfkaXKUbziwBHZxSIyfiy4KestthwEVs0mgmEDNkUSwuDNMIy2QUlpTAVVZlEJo0K9ymNi-gwpLCpF1ooUy8IElGKmf5Fub2aMyXS8LfXuBEMw_QqriE/s1600/copy9.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgJo95tV8B8VL_9CcUiA0gLp5kWfkaXKUbziwBHZxSIyfiy4KestthwEVs0mgmEDNkUSwuDNMIy2QUlpTAVVZlEJo0K9ymNi-gwpLCpF1ooUy8IElGKmf5Fub2aMyXS8LfXuBEMw_QqriE/s400/copy9.jpg" height="194" width="400" /></a></div>
<br />
<b><u>Step 9 :</u></b><br />
<b><u><br /></u></b>DI server repository is connected.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiqyF_1tyeXugHgEPSZOQ4krEZ9M-CA9VXv2xN6k5IyzcqJnVo0McnXioDdC_kCO96ffjnmwnoXr4KAEyuHbZSP0aU8IaOqlvg5_iXATb7N3ZfE9taHxvEObX2JHtUygSoSCkVqht6b0I4/s1600/copy10.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiqyF_1tyeXugHgEPSZOQ4krEZ9M-CA9VXv2xN6k5IyzcqJnVo0McnXioDdC_kCO96ffjnmwnoXr4KAEyuHbZSP0aU8IaOqlvg5_iXATb7N3ZfE9taHxvEObX2JHtUygSoSCkVqht6b0I4/s400/copy10.jpg" height="140" width="400" /></a></div>
<br />Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com3tag:blogger.com,1999:blog-7135267265798115356.post-47431084820583862372013-12-15T09:41:00.001-08:002014-02-20T13:47:21.574-08:00Pentaho Data Integration - PDI 5.0.2 Installation for Windows 64 bit<b><u>Step 1 : </u></b><br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhcmptx8ma5YZ-8_Crk9nczgm4gFHe3juk9PIEpF_StDkLHl1X7DC3ncGZeEGArqOIh7ivuhaG9gnuTtGrLAtsTmUIZ7hKV-Y0lTjahBc2_alQhjUHr-4KAP01MzFlipmSoeoprM6AIUj0/s1600/install.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhcmptx8ma5YZ-8_Crk9nczgm4gFHe3juk9PIEpF_StDkLHl1X7DC3ncGZeEGArqOIh7ivuhaG9gnuTtGrLAtsTmUIZ7hKV-Y0lTjahBc2_alQhjUHr-4KAP01MzFlipmSoeoprM6AIUj0/s1600/install.jpg" height="1" width="1" /></a></div>
<br />
Download latest PDI version ( 5.0.2 ) from http://pentaho.com/download<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjmwj1eUKQtXE8WOqK6-vQxhBShtlxxrEuRQqJcNuCXBLHZXZbljF3d_QAOyIfayCvAzwiUCiOAhJRiqc-H2F-clfAMP3O4mLUsqQayHOe_H9P0CK6BefkS3279fXSzLWF0WlN0PapLqnM/s1600/copy1.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjmwj1eUKQtXE8WOqK6-vQxhBShtlxxrEuRQqJcNuCXBLHZXZbljF3d_QAOyIfayCvAzwiUCiOAhJRiqc-H2F-clfAMP3O4mLUsqQayHOe_H9P0CK6BefkS3279fXSzLWF0WlN0PapLqnM/s400/copy1.jpg" height="246" width="400" /></a></div>
<a name='more'></a><br />
<b><u>Step 2 : </u></b><br />
<br />
Save and execute file pdi-5.0.2-x64.exe.<br />
Installation wizard will pop up. Click Forward.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjq_7UBZvmjnaiQL-9I4ideEfOjgYK0bqBYLXQnObd4Qg4uOO58JyvpfqVhSP7BlC4BMQ3qKBfxcTouzQZtV_flw3l4ojcgU2AYPF64NHxY36YklcfDtOwvFx1kFGp5l_sHbpoNpoS1V3A/s1600/copy2.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjq_7UBZvmjnaiQL-9I4ideEfOjgYK0bqBYLXQnObd4Qg4uOO58JyvpfqVhSP7BlC4BMQ3qKBfxcTouzQZtV_flw3l4ojcgU2AYPF64NHxY36YklcfDtOwvFx1kFGp5l_sHbpoNpoS1V3A/s400/copy2.jpg" height="290" width="400" /></a></div>
<br />
<b><u>Step 3 : </u></b><br />
<br />
Accept license agreements. Click Forward.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhbx_Inm9k2lGi4Vf4Oo6L_w7wpqqDNoDTGEeDHbMJbk4j4xo_Goaf1FgCKi3cjKBwofEKbbvEACqNxJSZVBrWgVg9Vbou2VevbH8MmSK9EejkNO8g_2Yd7EIdjcku9bqxrFNWm-s8ehzY/s1600/copy3.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhbx_Inm9k2lGi4Vf4Oo6L_w7wpqqDNoDTGEeDHbMJbk4j4xo_Goaf1FgCKi3cjKBwofEKbbvEACqNxJSZVBrWgVg9Vbou2VevbH8MmSK9EejkNO8g_2Yd7EIdjcku9bqxrFNWm-s8ehzY/s400/copy3.jpg" height="291" width="400" /></a></div>
<br />
<b><u>Step 4 : </u></b><br />
<br />
Choose installation location. Click Forward.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj0LP76N503bYMgNOaQ8BAxLAPDvB3aSp-cgTxJeuRzwpjWbQuUb0uj4JcFVHE6p0_EtQPk61e7wShpzZr1uK1CuGbISb8miFrMOaq3NzI6hGOqiXB07t5hWDfzTQq_cgalCZ7qTIW1liM/s1600/copy4.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEj0LP76N503bYMgNOaQ8BAxLAPDvB3aSp-cgTxJeuRzwpjWbQuUb0uj4JcFVHE6p0_EtQPk61e7wShpzZr1uK1CuGbISb8miFrMOaq3NzI6hGOqiXB07t5hWDfzTQq_cgalCZ7qTIW1liM/s400/copy4.jpg" height="291" width="400" /></a></div>
<br />
<b><u>Step 5 : </u></b><br />
<br />
Set up password for PostgreSQL.<br />
<div>
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjrMaYpxWyntfk-iZy-jfD6W1xY3RxhQcTUaP_mNe7u_sIgf0vOtXzc-n1AROnfEdD0dWIURKB47kFUuQN-HEz0UPWECZSdRJ_QQSi9oYBsGIbz_tFXa8BIPf87v_8lyd_LnJCuhgRnJbo/s1600/copy5.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjrMaYpxWyntfk-iZy-jfD6W1xY3RxhQcTUaP_mNe7u_sIgf0vOtXzc-n1AROnfEdD0dWIURKB47kFUuQN-HEz0UPWECZSdRJ_QQSi9oYBsGIbz_tFXa8BIPf87v_8lyd_LnJCuhgRnJbo/s400/copy5.jpg" height="291" width="400" /></a></div>
<br />
<b><u>Step 6 : </u></b><br />
<br />
Set up will begin installing Pentaho applications.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEickZgrp6BghONleA4v7nYrwhsl7RlcLmLjOTXa1MyCPWB4m8mWyTelahKjTJYmEOZOCUi88wG_0Xp6M-K2-A3XbeLt58-5Ho2G3J1iOQwM6kNxEkMMBDTLlB43fVUjnMmg5CY_VcFSLzQ/s1600/copy6.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEickZgrp6BghONleA4v7nYrwhsl7RlcLmLjOTXa1MyCPWB4m8mWyTelahKjTJYmEOZOCUi88wG_0Xp6M-K2-A3XbeLt58-5Ho2G3J1iOQwM6kNxEkMMBDTLlB43fVUjnMmg5CY_VcFSLzQ/s400/copy6.jpg" height="291" width="400" /></a></div>
<br />
<b><u>Step 7 : </u></b><br />
<br />
Evaluation licenses will be auto generated and installed for trial versions.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgIVg2ULXwdcQyaThLAZLQS39_E6xpTgVnDHxQERDSuGfJboNfhkZQAIPGWLcVXz7wHbI6xccPOZQOk76Xzy9066qiJaR3ktC45nyVcPJBFQjnyYy8e6W2px9fhm2XibygKpm14-2NUoCM/s1600/copy7.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgIVg2ULXwdcQyaThLAZLQS39_E6xpTgVnDHxQERDSuGfJboNfhkZQAIPGWLcVXz7wHbI6xccPOZQOk76Xzy9066qiJaR3ktC45nyVcPJBFQjnyYy8e6W2px9fhm2XibygKpm14-2NUoCM/s320/copy7.jpg" height="233" width="320" /></a></div>
<br />
<b><u>Step 8 : </u></b><br />
<br />
Installation in progress.<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh4DS9n7hTsxYi1I7OWbzZu6_RDuwsoZfGZqlZJq-zP5a2HvaV94SLwXJwnJJZ5mVI1x2AYGpQzAY9HwzC4CAxk1GU-PlJ13v39dWjTk8a7b24Kc1cExcAHz56qrtDvmzuABqiz9e-76iA/s1600/copy8.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh4DS9n7hTsxYi1I7OWbzZu6_RDuwsoZfGZqlZJq-zP5a2HvaV94SLwXJwnJJZ5mVI1x2AYGpQzAY9HwzC4CAxk1GU-PlJ13v39dWjTk8a7b24Kc1cExcAHz56qrtDvmzuABqiz9e-76iA/s400/copy8.jpg" height="291" width="400" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<b><u>Step 9 : </u></b><br />
<br />
Initializing database.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjTVlX3uYiNSwx6TUl9CIj9kBdYaC8c3qTySOOzYU5tdg40ExckAfU1VoDUCSZWNopGKJKmKXNrVhDzAna5E0Q1bPbZYMX7OeRLrZZdCP5KlBHBioloooab0xLjxrfcKcrhlM3m4_ndQ8A/s1600/copy9.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjTVlX3uYiNSwx6TUl9CIj9kBdYaC8c3qTySOOzYU5tdg40ExckAfU1VoDUCSZWNopGKJKmKXNrVhDzAna5E0Q1bPbZYMX7OeRLrZZdCP5KlBHBioloooab0xLjxrfcKcrhlM3m4_ndQ8A/s400/copy9.jpg" height="281" width="400" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<div class="separator" style="clear: both;">
<b><u>Step 10 :</u></b> </div>
<div class="separator" style="clear: both;">
<br /></div>
<div class="separator" style="clear: both;">
Installation complete.</div>
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjiaXjBodTiftMGT9avIRt6pqNZ7geW5OGbgYmyss8EQnOvWm5VGsfwdAxhjiT-6_1SKN1BSQn1gTUKWNo9DxKmQGox2QpRVFYl6rOf1Svp-Doe82BZ7HXeTnUe8gk4U44o2siJu7Z0xj8/s1600/copy10.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjiaXjBodTiftMGT9avIRt6pqNZ7geW5OGbgYmyss8EQnOvWm5VGsfwdAxhjiT-6_1SKN1BSQn1gTUKWNo9DxKmQGox2QpRVFYl6rOf1Svp-Doe82BZ7HXeTnUe8gk4U44o2siJu7Z0xj8/s400/copy10.jpg" height="288" width="400" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<div class="separator" style="clear: both;">
<b><u>Step 11 : </u></b></div>
<div class="separator" style="clear: both;">
<br /></div>
<div class="separator" style="clear: both;">
Post Installation - Enterprise Console and Spoon can be launched for configuration.</div>
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjewrscDBXtPLLs4PzrUBTMIzrgcsZ53LrI8PI2hgJMYObCbX1AgKk3N4MykcJpE_eivNqNM8n1EgnaLSrxIMNUrZhnzXm_qweamT4WJH3OPZbKJ6iyLIwSH09Z3gu_2CCrQfTkkcEkg6Y/s1600/copy11.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjewrscDBXtPLLs4PzrUBTMIzrgcsZ53LrI8PI2hgJMYObCbX1AgKk3N4MykcJpE_eivNqNM8n1EgnaLSrxIMNUrZhnzXm_qweamT4WJH3OPZbKJ6iyLIwSH09Z3gu_2CCrQfTkkcEkg6Y/s400/copy11.jpg" height="277" width="400" /></a></div>
<br />Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com1tag:blogger.com,1999:blog-7135267265798115356.post-2132796398765990422013-11-10T10:25:00.001-08:002014-02-20T13:48:29.229-08:00Pentaho Data Integration - PDI 5.0.1 Installation for Linux<b><u>Step 1 : </u></b><br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhcmptx8ma5YZ-8_Crk9nczgm4gFHe3juk9PIEpF_StDkLHl1X7DC3ncGZeEGArqOIh7ivuhaG9gnuTtGrLAtsTmUIZ7hKV-Y0lTjahBc2_alQhjUHr-4KAP01MzFlipmSoeoprM6AIUj0/s1600/install.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhcmptx8ma5YZ-8_Crk9nczgm4gFHe3juk9PIEpF_StDkLHl1X7DC3ncGZeEGArqOIh7ivuhaG9gnuTtGrLAtsTmUIZ7hKV-Y0lTjahBc2_alQhjUHr-4KAP01MzFlipmSoeoprM6AIUj0/s1600/install.jpg" height="1" width="1" /></a></div>
<br />
Download latest PDI version ( 5.0.1 ) from http://pentaho.com/download<br />
<br />
<b><u>Step 2 : </u></b><br />
<br />
Save the bin file to Downloads directory.<br />
Execute bin file pdi-5.0.1-x64.bin.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgz8MPB9N1CttpfqqA8i4fju-dEQmca0qDJubL0uWsRdt-jvkWWtlmRKQdlZQKxKv6hLyXeuSHSGmdKbiM21hQ8EFWfp_TjufM2l-8634Xxn_C0w0VP7moky9gXV_kqs1KHUcCkEqzu9Jc/s1600/copy1.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgz8MPB9N1CttpfqqA8i4fju-dEQmca0qDJubL0uWsRdt-jvkWWtlmRKQdlZQKxKv6hLyXeuSHSGmdKbiM21hQ8EFWfp_TjufM2l-8634Xxn_C0w0VP7moky9gXV_kqs1KHUcCkEqzu9Jc/s1600/copy1.jpg" /></a></div>
<br />
<a name='more'></a><br />
<b><u>Step 3 : </u></b><br />
<br />
Installation wizard will pop up. Click Forward.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiaEKklSwgqK6AqX88yVqvAgrQCBXvg30FTHnW3jV-cv8JDNeWrxeG0uQ4q4vQGBFWy7aTl_TQl5ZXKd_gQ2HqvRuiwTkfoynjKIrfj7VohovG3rR2sSXbHL9uw3yAMNlWZgLHpx9Q5PtI/s1600/copy2.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiaEKklSwgqK6AqX88yVqvAgrQCBXvg30FTHnW3jV-cv8JDNeWrxeG0uQ4q4vQGBFWy7aTl_TQl5ZXKd_gQ2HqvRuiwTkfoynjKIrfj7VohovG3rR2sSXbHL9uw3yAMNlWZgLHpx9Q5PtI/s320/copy2.jpg" height="291" width="320" /></a></div>
<br />
<br />
<b><u>Step 4 : </u></b><br />
<br />
Accept license agreements. Click Forward.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgAZBJXCdMTpMUMF1tqf2hDveeCfVM1hRxKllePcVWx6PkTME8yZhgl8j-l8ve_ZTeOVGkLvmCaonv-nIqbFBzrQMeMa0NegwXaToozOUjV4jerehy-LtFbumL4vNlJ8TZZzDnY6THYy_U/s1600/copy3.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgAZBJXCdMTpMUMF1tqf2hDveeCfVM1hRxKllePcVWx6PkTME8yZhgl8j-l8ve_ZTeOVGkLvmCaonv-nIqbFBzrQMeMa0NegwXaToozOUjV4jerehy-LtFbumL4vNlJ8TZZzDnY6THYy_U/s320/copy3.jpg" height="242" width="320" /></a></div>
<br />
<br />
<b><u>Step 5 : </u></b><br />
<br />
Choose installation location. Click Forward.<br />
<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhJcrO_LKYyUc0fpVZhQ0BNOmsL-xSmjO_viG2eOqiRLV_f651uU0zVE6v77ZXoOu9-9_kI46AaIkfxOy0IkbiBd5GwCdQ8f0b_0_jjNMubN1gHBprkBVU3dSiXhit8zyYCLuAN0-aR3o0/s1600/copy4.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhJcrO_LKYyUc0fpVZhQ0BNOmsL-xSmjO_viG2eOqiRLV_f651uU0zVE6v77ZXoOu9-9_kI46AaIkfxOy0IkbiBd5GwCdQ8f0b_0_jjNMubN1gHBprkBVU3dSiXhit8zyYCLuAN0-aR3o0/s320/copy4.jpg" height="237" width="320" /></a></div>
<br />
<br />
<b><u>Step 6 : </u></b><br />
<br />
Set up password for PostgreSQL.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgLGPbiV9qI9yEpg77PRAWP2WfJ0VnXxjIVgEc3gEWbWMw7Yk-jo-2WW3HQOis5xPRLg6RqslEaT6dXIgntu2WfvsGKgNZUILxhL4WBOnOpoVfLj4SvI4z6T1ez7Ij37DGg9xfz4euRJcM/s1600/copy5.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgLGPbiV9qI9yEpg77PRAWP2WfJ0VnXxjIVgEc3gEWbWMw7Yk-jo-2WW3HQOis5xPRLg6RqslEaT6dXIgntu2WfvsGKgNZUILxhL4WBOnOpoVfLj4SvI4z6T1ez7Ij37DGg9xfz4euRJcM/s320/copy5.jpg" height="239" width="320" /></a></div>
<br />
<br />
<b><u>Step 7 : </u></b><br />
<br />
Set up will begin installing Pentaho applications.<br />
Evaluation licenses will be auto generated and installed for trial versions.<br />
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhqznXsgYDQrFQF4L2sQLQ3qQDPlyXp2yePmTMoWdnCjTThsq8GzwTl8K81TUp5Y2GwC-TeoH00kklaZky3t8dpoLfZ0xoFX8VPOaW096Sf5lYJaDjy93vBHB3lrgFlGTQ2radiHyRvIdk/s1600/copy7.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhqznXsgYDQrFQF4L2sQLQ3qQDPlyXp2yePmTMoWdnCjTThsq8GzwTl8K81TUp5Y2GwC-TeoH00kklaZky3t8dpoLfZ0xoFX8VPOaW096Sf5lYJaDjy93vBHB3lrgFlGTQ2radiHyRvIdk/s320/copy7.jpg" height="242" width="320" /></a></div>
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<div class="separator" style="clear: both; text-align: left;">
<br /></div>
<b><u>Step 8 : </u></b><br />
<br />
Installation in progress.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjyfE69tZn0fyWRnnGi4NaOnqBbh0lvHBAa6QfdPpB2Q6FgSObis8rZxH_5Ou7Xfh4mYODhhv9SrymqWpGKDowbPxU63Ol7iz8hRjSaE-_8Bg-D4CpJOuxqRmTo78sMogr1ilA2LoHlKj4/s1600/copy8.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjyfE69tZn0fyWRnnGi4NaOnqBbh0lvHBAa6QfdPpB2Q6FgSObis8rZxH_5Ou7Xfh4mYODhhv9SrymqWpGKDowbPxU63Ol7iz8hRjSaE-_8Bg-D4CpJOuxqRmTo78sMogr1ilA2LoHlKj4/s320/copy8.jpg" height="237" width="320" /></a></div>
<br />
<br />
<b><u>Step 9 : </u></b><br />
<br />
Installation complete.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjw65aNYjgyO3bpaERHou4dXsQ746qY8NyNJKjmg89PndavU8rV9e3hIppA6m4YIS36npur83a-vShsWcFuXaoptsDwkDcXaIVWZ0cIFJGOfe3kMqGKrmwpZzvfEL-VFv9cyyXmeKXyfwo/s1600/copy9.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjw65aNYjgyO3bpaERHou4dXsQ746qY8NyNJKjmg89PndavU8rV9e3hIppA6m4YIS36npur83a-vShsWcFuXaoptsDwkDcXaIVWZ0cIFJGOfe3kMqGKrmwpZzvfEL-VFv9cyyXmeKXyfwo/s320/copy9.jpg" height="240" width="320" /></a></div>
<br />
<br />
<b><u>Step 10 : </u></b><br />
<br />
Post Installation - Enterprise Console and Spoon can be launched for configuration.<br />
<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhG_ElgDUr9abz4Xg1CrEqQHp7YiKDgoJ0aLGxVO9uRQf7_ObabrkB4oyB5fpwDun_6Vsvldw_UePKeyGAkIbG9tYTjro_9UMOdSLuKtE99ide7ylR1oLMFpYhlD_mDHTzEH5eFh5FbFSM/s1600/copy10.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhG_ElgDUr9abz4Xg1CrEqQHp7YiKDgoJ0aLGxVO9uRQf7_ObabrkB4oyB5fpwDun_6Vsvldw_UePKeyGAkIbG9tYTjro_9UMOdSLuKtE99ide7ylR1oLMFpYhlD_mDHTzEH5eFh5FbFSM/s320/copy10.jpg" height="222" width="320" /></a></div>
<br />
<br />Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com0tag:blogger.com,1999:blog-7135267265798115356.post-1660306022518088052013-10-04T10:57:00.003-07:002014-02-20T14:07:02.273-08:00Pentaho Repository Queries<b><u>User Info</u></b><br />
<b><u><br /></u></b>
SELECT LOGIN, NAME, DESCRIPTION, ENABLED FROM R_USER<br />
<b><u><br /></u></b>
<b><u>Job Info</u></b><br />
<b><u><br /></u></b>
SELECT NAME, DESCRIPTION, JOB_VERSION, JOB_STATUS, CREATED_USER, CREATED_DATE, MODIFIED_USER, MODIFIED_DATE FROM R_JOB<br />
<br />
<b><u>Transformation Info</u></b><br />
<b><u><br /></u></b>
SELECT NAME, DESCRIPTION, TRANS_VERSION, TRANS_STATUS, CREATED_USER, CREATED_DATE, MODIFIED_USER, MODIFIED_DATE FROM R_TRANSFORMATION<br />
<a name='more'></a><br />
<b><u>Step Info</u></b><br />
<b><u><br /></u></b>
SELECT<br />
S.NAME STEP_NAME,<br />
S.DESCRIPTION,<br />
ST.DESCRIPTION AS STEP_TYPE,<br />
T.NAME AS TRANS_NAME<br />
FROM<br />
R_STEP S,<br />
R_STEP_TYPE ST,<br />
R_TRANSFORMATION T<br />
WHERE<br />
S.ID_STEP_TYPE = ST.ID_STEP_TYPE<br />
<br />
AND S.ID_TRANSFORMATION = T.ID_TRANSFORMATION<br />
<br />
<br />
<b><u>Job Entry Info</u></b><br />
<b><u><br /></u></b>
SELECT<br />
E.NAME JOB_ENTRY_NAME,<br />
E.DESCRIPTION,<br />
JT.DESCRIPTION AS ENTRY_TYPE,<br />
J.NAME AS JOB_NAME<br />
FROM<br />
R_JOBENTRY E,<br />
R_JOBENTRY_TYPE JT,<br />
R_JOB J<br />
WHERE<br />
E.ID_JOBENTRY_TYPE = JT.ID_JOBENTRY_TYPE<br />
AND E.ID_JOB = J.ID_JOB<br />
<br />
<br />
<b><u>DB Connection Info</u></b><br />
<b><u><br /></u></b>
SELECT<br />
T.DESCRIPTION DATABASE_TYPE,<br />
D.NAME NAME,<br />
D.HOST_NAME,<br />
D.DATABASE_NAME,<br />
D.PORT,<br />
D.USERNAME<br />
FROM<br />
R_DATABASE D,<br />
R_DATABASE_TYPE T<br />
WHERE<br />
<br />
D.ID_DATABASE_TYPE = T.ID_DATABASE_TYPE<br />
<br />
<b><u>Dependencies</u></b><br />
<b><u><br /></u></b>Check for all transformations using given table, say PRODUCT_MASTER<br />
<br />
SELECT<br />
S.NAME STEP_NAME,<br />
S.DESCRIPTION,<br />
ST.DESCRIPTION AS STEP_TYPE,<br />
T.NAME AS TRANS_NAME,<br />
SA.VALUE_STR<br />
FROM<br />
R_STEP S,<br />
R_STEP_TYPE ST,<br />
R_TRANSFORMATION T,<br />
R_STEP_ATTRIBUTE SA<br />
WHERE<br />
S.ID_STEP_TYPE = ST.ID_STEP_TYPE<br />
AND S.ID_TRANSFORMATION = T.ID_TRANSFORMATION<br />
AND SA.ID_TRANSFORMATION = T.ID_TRANSFORMATION<br />
AND SA.ID_STEP = S.ID_STEP<br />
AND SA.CODE = 'sql'<br />
and SA.VALUE_STR like '%PRODUCT_MASTER%'<br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjPnYDBuOof8mQBDDTEytDAR1ycXaendpSgORbZ5dCMwf_U1MlyHAALFEJyIlZuJWzfxvK1aoYbup4jDMoJy6XgNGWk2chcskWcSMMoi5mDEePK3mepSE37NPBLjyiRi3oowWuz74K80vc/s1600/pent11.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjPnYDBuOof8mQBDDTEytDAR1ycXaendpSgORbZ5dCMwf_U1MlyHAALFEJyIlZuJWzfxvK1aoYbup4jDMoJy6XgNGWk2chcskWcSMMoi5mDEePK3mepSE37NPBLjyiRi3oowWuz74K80vc/s1600/pent11.jpg" /></a></div>
<br />Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com0tag:blogger.com,1999:blog-7135267265798115356.post-43001345728988134422013-09-24T11:22:00.001-07:002014-02-20T13:55:30.685-08:00Pentaho Data Integration - PDI Installation for Windows<b><u>Step 1 : </u></b><br />
<div class="separator" style="clear: both; text-align: left;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjzTEuUvcaA75Ovuo7HDMqjHqe349yIIE7CWw1sFhjmr_C2PhHmH4xg-sIEYc53En49yDbfiYpQjQ5-6ELnG1YFAKuxMApRJ3TSrJGOfL3N5lI0L3PUjCPzUxiJbTue7P1i3cwVUxpDTdc/s1600/config.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEjzTEuUvcaA75Ovuo7HDMqjHqe349yIIE7CWw1sFhjmr_C2PhHmH4xg-sIEYc53En49yDbfiYpQjQ5-6ELnG1YFAKuxMApRJ3TSrJGOfL3N5lI0L3PUjCPzUxiJbTue7P1i3cwVUxpDTdc/s1600/config.jpg" height="1" width="1" /></a></div>
<br />
Download latest PDI version from http://pentaho.com/download.<br />
Choose 32 Bit or 64 Bit based on OS requirements.<br />
<br />
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh4JnMxhNNrF6y2sLjd_5QlCU2ZUJaCed3TNw4_VPBFf-FrByRqYQ7SNAOwcco1uikya-AGEZ9Tkxp_wRd0RVfRs0az93s5asWw90P-BHpIEOVx-tZG8-PKMm0v0BdUclme6zXm8BUQbrc/s1600/pdi_dload.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em; text-align: center;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh4JnMxhNNrF6y2sLjd_5QlCU2ZUJaCed3TNw4_VPBFf-FrByRqYQ7SNAOwcco1uikya-AGEZ9Tkxp_wRd0RVfRs0az93s5asWw90P-BHpIEOVx-tZG8-PKMm0v0BdUclme6zXm8BUQbrc/s320/pdi_dload.jpg" height="175" width="320" /></a><br />
<br />
<a name='more'></a><br />
<b><u>Step 2 :</u></b><br />
<b><u><br /></u></b>
PDI 4.2.1 version used for this demo purpose.<br />
Click on pdi-4.2.1-GA-i386.exe and accept the license agreements.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiY3u69yVOzx8CrNv9eDS66jEt92y4I0MjKSB-HyuAaLZntOXuhlRwsAdcWh4OCLFTjNdMvkCE6cLd_tja8a2v1xsKrsSTZWYYh6yYGBWxZxzy3ecJewDuyx-sH5hsPjpA_ouMO2UFwXck/s1600/step1.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em; text-align: left;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiY3u69yVOzx8CrNv9eDS66jEt92y4I0MjKSB-HyuAaLZntOXuhlRwsAdcWh4OCLFTjNdMvkCE6cLd_tja8a2v1xsKrsSTZWYYh6yYGBWxZxzy3ecJewDuyx-sH5hsPjpA_ouMO2UFwXck/s320/step1.jpg" height="64" width="320" /></a></div>
<br />
<b><u><br /></u></b>
<br />
<br />
<br />
<br />
<br />
<b><u>Step 3 :</u></b><br />
<div>
<br /></div>
PDI enterprise edition set up will pop up. Please click next.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEijMHIVlpouQmhGMREQU3nLMSYKmfkwDGLk0cyezP79Nq2xbNUMJMQm-v6xZcqW_285B-YVdNzpCKXRrQY7lKVChmLN9rzVnQ0xUjWyuKRqsmnQiWeWOUQ7eBTC_SGeqEwgooTD5q4ve5A/s1600/step2.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em; text-align: left;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEijMHIVlpouQmhGMREQU3nLMSYKmfkwDGLk0cyezP79Nq2xbNUMJMQm-v6xZcqW_285B-YVdNzpCKXRrQY7lKVChmLN9rzVnQ0xUjWyuKRqsmnQiWeWOUQ7eBTC_SGeqEwgooTD5q4ve5A/s320/step2.jpg" height="240" width="320" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<b><u><br /></u></b>
<b><u><br /></u></b>
<b><u><br /></u></b>
<b><u><br /></u></b>
<b><u>Step 4 :</u></b><br />
<div>
<div style="text-align: left;">
Set up Administration account for Pentaho Enterprise Console.</div>
</div>
<div style="text-align: left;">
Enter password. Note down the password for later reference.</div>
<div style="text-align: left;">
<br /></div>
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhyMGlwY2-zfsXmjCk0V4kaXXeyKAJGPF8lGfIg7-9PR_3esyRdPW7iIEAIAY9B_oavmkFTAmgkPmwp98zbbY7vPnizBIf3iVdLVYF6uGDT1Peba95vZkw_J6hgcr2Nztbqbu06_jSmCgM/s1600/step4.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em; text-align: left;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhyMGlwY2-zfsXmjCk0V4kaXXeyKAJGPF8lGfIg7-9PR_3esyRdPW7iIEAIAY9B_oavmkFTAmgkPmwp98zbbY7vPnizBIf3iVdLVYF6uGDT1Peba95vZkw_J6hgcr2Nztbqbu06_jSmCgM/s320/step4.jpg" height="241" width="320" /></a></div>
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<br />
<b><u>Step 5 :</u></b><br />
<div>
<br /></div>
Set up will begin installing Pentaho applications.<br />
Evaluation licenses will be auto generated and installed for trial versions.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgJxFFTyAeq7NVxnrzH6DFhUxqB-ey8KweEnUI2flw_XT0gwioarel_oEFCMhQylXEbkwbbqdFVA2nP4tv_ITy0p_vBRS2mDsUkOIqhhqIzYG2yd6qRv1QdyQiJjpL24ClMaaLi3cPuzHw/s1600/step5.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em; text-align: left;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgJxFFTyAeq7NVxnrzH6DFhUxqB-ey8KweEnUI2flw_XT0gwioarel_oEFCMhQylXEbkwbbqdFVA2nP4tv_ITy0p_vBRS2mDsUkOIqhhqIzYG2yd6qRv1QdyQiJjpL24ClMaaLi3cPuzHw/s320/step5.jpg" height="240" width="320" /></a></div>
<br />
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<br />
<b><u>Step 6 :</u></b><br />
<b><u><br /></u></b>
Installation Complete - Enterprise Console and Spoon can be launched for configuration.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhmVJHlMWBTCO7pyCD5QemYImD83u9A3vVsv0JnHAUJAlaRWnI9JfmIrQJrPjf7D7RYAjPRypmpna4U7DMsMvIYw3oiX48UhCAkgpjEeGZRPBQPJoDsRgcu_7oEx1yswmFz-KTLyviNHBQ/s1600/step7.jpg" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em; text-align: left;"><img border="0" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhmVJHlMWBTCO7pyCD5QemYImD83u9A3vVsv0JnHAUJAlaRWnI9JfmIrQJrPjf7D7RYAjPRypmpna4U7DMsMvIYw3oiX48UhCAkgpjEeGZRPBQPJoDsRgcu_7oEx1yswmFz-KTLyviNHBQ/s320/step7.jpg" height="242" width="320" /></a></div>
<br />
<div class="separator" style="clear: both; text-align: center;">
<br /></div>
<br />
<br />Aravindhttp://www.blogger.com/profile/05617952193085138519noreply@blogger.com1