Skip to main content

What are useful resources for a newcomer to data analysis techniques?

This article are taken from http://searchcloudcomputing.techtarget.com/answer/
It is written by Dan Sullivan
---------------------------------------------------------------------------------------------------

The useful resources for a newcomer to data analysis techniques:

 

Many organizations are adept at collecting data, but the real value is only realized when the data is analyzed. Creating and maintaining a data analysis practice will require support from cloud administrators, as well as data analysts. Cloud administrators will be called on to configure systems, evaluate architectures and maintain infrastructure for data analysts. The more you know about the practice of data analysis, the better you can support it.

Using a combination of books and online tutorials while working with various tools can help you dive into data analysis while staying linked to your own real-world data analysis problems.

Many data analysis techniques are taken from statistics and machine learning. Cousera.org, the free resource for massive online courses, offers courses in computing for data analysis, mathematical modeling and statistics. Andrew Ng's course on machine learning at Cousera is well designed for students new to the topic.

Philipp Janert's book Data Analysis with Open Source Tools introduces statistical techniques along with open source tools. Wes McKinney's Python for Data Analysis: Agile Tools for Real-World Data is a good introduction to working with data in Python.

R is a widely used open source statistical analysis tool with a wide set of add-on packages. The R Tutorial is a gentle introduction to R, but it has some more advanced articles as well. The Pandas Python package has features comparable to R, and it is a good fit for Python developers that want to use Python for collecting, formatting and analyzing data.

Getting started with data-mining tools does not have to be intimidating. RapidMiner is an open source data-mining tool with an easy-to-use interface and a wide collection of research tools available.

Visualization tools such as Tableau Software, a visualization service, can help you better understand large data sets with many variables. This is a fee-for-service product, but there is a free trial if you want to give it a try.

Comments

Popular posts from this blog

An attempt was made to insert a node where it is not permitted

Do you face this Error while you are writing code to generate xml file from java? Exception in thread "main" org.w3c.dom.DOMException : HIERARCHY_REQUEST_ERR: An attempt was made to insert a node where it is not permitted.        at com.sun.org.apache.xerces.internal.dom.CoreDocumentImpl.insertBefore(Unknown Source)        at com.sun.org.apache.xerces.internal.dom.NodeImpl.appendChild(Unknown Source)        at generatexml.WriteXMLFile.main( WriteXMLFile.java:30 ) Well the answer is: Don't insert the node where it isn't permitted. Change your generated directory file path from 'C' to other directory ex, D or to any directory you have. Make sure the ‘appendChild’ is referring to the right element. Don’t appending twice, only make it once. Ex, //Writetoxml.java   Element rootElement = doc . createElement ( " Company " );   doc . appendChild ( rootElement );                Element subElement = doc . cre

Retrieve Data from Database and Compare it with user input using Java

In this Lesson we will create a form page " form.jsp " that takes the user email. After that we will check the user existence in our DB. If the user email stored in the DB, a welcoming page will be opened to him/her. If the user is a new user then a message will be displayed that tells him/her this email is not stored in our DB. Note: I use access 2013 database and Eclipse Juno     Basic step, Create a new project: Open Eclipse then click on File > New > Other > Web > Dynamic Web Project. First, Load the DB class: In this step we will connect with database so we will gather all its code in a java class named " DBConnection.java " under a package called " code ". Expand your project then right click on java resources > New > Package . After-that Give a name for your package ex, code   Right click on your created new package that called code > New > Class.   After-that Give a name for your Class ex, DBConnection.  

Do you want to know about your computer in one window screen?

System Toolbox (Sys Toolbox) If you want to know information about your computer from processor, drivers, motherboard, memory, operating system,...and more, you can depend on this software. Sys Toolbox (see screen shoot from the software) provides a software and hardware information for windows operating system. It is a simple software that you can use it to know about your PC. To run the software, you need to: Extract the zipped software.  After that, right click on the software icon and choose "Run as administrator". Finally, you will find a pop up screen telling you information about you PC. To go to the software site: http://sys-toolbox-pro.soft112.com/ To Download the software: Download