Skip to main content

What are useful resources for a newcomer to data analysis techniques?

This article are taken from http://searchcloudcomputing.techtarget.com/answer/
It is written by Dan Sullivan
---------------------------------------------------------------------------------------------------

The useful resources for a newcomer to data analysis techniques:

 

Many organizations are adept at collecting data, but the real value is only realized when the data is analyzed. Creating and maintaining a data analysis practice will require support from cloud administrators, as well as data analysts. Cloud administrators will be called on to configure systems, evaluate architectures and maintain infrastructure for data analysts. The more you know about the practice of data analysis, the better you can support it.

Using a combination of books and online tutorials while working with various tools can help you dive into data analysis while staying linked to your own real-world data analysis problems.

Many data analysis techniques are taken from statistics and machine learning. Cousera.org, the free resource for massive online courses, offers courses in computing for data analysis, mathematical modeling and statistics. Andrew Ng's course on machine learning at Cousera is well designed for students new to the topic.

Philipp Janert's book Data Analysis with Open Source Tools introduces statistical techniques along with open source tools. Wes McKinney's Python for Data Analysis: Agile Tools for Real-World Data is a good introduction to working with data in Python.

R is a widely used open source statistical analysis tool with a wide set of add-on packages. The R Tutorial is a gentle introduction to R, but it has some more advanced articles as well. The Pandas Python package has features comparable to R, and it is a good fit for Python developers that want to use Python for collecting, formatting and analyzing data.

Getting started with data-mining tools does not have to be intimidating. RapidMiner is an open source data-mining tool with an easy-to-use interface and a wide collection of research tools available.

Visualization tools such as Tableau Software, a visualization service, can help you better understand large data sets with many variables. This is a fee-for-service product, but there is a free trial if you want to give it a try.

Comments

Popular posts from this blog

Error Class names are only accepted if annotation processing is explicitly requested

Do you get the following error? Class names, 'Hello', are only accepted if annotation processing is explicitly requested 1 error In case you got this error, then you forget to add .java to the file name when you compile it So when you want to compile a file using cmd console window write the filename.java extension Example: Javac Hello.java If you write it in this way the error will go away. So don’t forget to include suffix with your file name during compilation.
Cli.java won't compile in jahmm Problem: When you compile Cli.class in terminal or Eclipse, you may get the following error: Cli.java:27: package be.ac.ulg.montefiore.run.jahmm.io does not exist import be.ac.ulg.montefiore.run.jahmm.io.FileFormatException; ^ Cli.java:54: cannot find symbol symbol : class AbnormalTerminationException 9 more errors Solution: You must to pass parameters to your class. The class takes 5 parameters which are: Parameter 1: should be one of the following: create:  creates a new HMM description file, print:  prints a HMM learn-kmeans:   applies the k-Means algorithm learn-bw:  applies the Baum-Welch algorithm generate:    generates an observation sequence given a HMM distance-kl:    computes the distance between two HMMs Parameter 2: -opdf. It takes one of the following: -r: argument describes that HMM will take 4 integers. For example, -r 4 means that HMM takes ...

Drive A not ready…the drive is not ready for use it's door may be open Please check drive A and make sure that a disk is inserted?

Does that happen to you!!! Well it happened to me. I got a message tell me “ Drive not ready the drive is not ready for use it's door may be open Please check drive A and make sure that a disk is inserted and that the drive door is closed?” and it gives me three options Continue, Cancel, and Try again Well the answer is so easy but first you should know what cause this problem to happen. Let say you have programs and one of your programs is checking for a floppy drive I'd guess antivirus. So you have two options to do that but first of all do the basic steps which are: scan your computer well with antivirus software, Defragment your hard drive, and do the windows disk clean up and error checking. If this step doesn’t work then pick one option from the following options: Option 1 Disabling the floppy controller in your Device Manager : a.        Right click on My Computer b.       Choose Manage c.   ...