Graphpad Import Excel

0 views
Skip to first unread message

Zebedeo Konig

unread,
Aug 4, 2024, 7:47:37 PM8/4/24
to klasattehmang
Ihave already calculated p values and log2 fold changes for my proteomics data on excel. I just don't know where to generate my volcano plot. Some simple step by step advice would be really really appreciated on an easy to use software. I cannot use R - its impossible.

I'm pretty sure that neither Excel nor Graphpad Prism can do volcano plots easily (though I found a tutorial for doing them in Excel using an add-in called XLSTAT (costs money) just by googling). Why do you need a volcano plot? If you are doing microarray analysis, then CLC genomics workbench has them natively built in.


Note that if you are wanting to get into the bioinformatics field in any significant manner, then R really is the way to go, it takes a bit of fiddling to learn, but there are many many tutorials and examples on the web, and lots of community support on many sites too.


I was hoping I could find a simple software where I could just input my calculated p values and LOG2 fold changes from Excel. Ive looked at the online R tutorials but even the most basic functions require so much fiddling around. Im not even sure how I can input the vast amount of data I have into R from excel?


R isn't that hard - it just looks it at first. It helps to know that there are tons of people doing this, so you can just google the problem you have and see if it works for you. If you want to go this route, get R from your local mirror, and get the desktop version of Rstudio, it will make your life easier.


Once you have installed and opened, you should see 4 windows - two on the left and two on the right. The two on the right are for data and outputs, while the two on the left are for input commands. The top left is if you want to program (which you do, so that you use the same commands to do all your analysis, and you don't have to remember exactly what you typed each time), the bottom left is the "execution" area, where the commands are run.


If you have an excel file and want to get it into Rstudio... On the top right window there should be a button saying "import dataset". Click on it and it'll give you some options, use the one appropriate to your file format, browse to your file, it should preview your file, then click import (bottom right).


This should open a version of the file in the top left and run some commands in the bottom left. Blank cells in the file should be filled with "NA". The command should look something like: your_file_name


Now, here's where it gets a little more tricky - you will need some packages (these are sub-programs within R that do small things for particular purposes). Under the "tools" menu, choose install packages, and type "tidyverse" into the "Packages" section. Tidyverse is a bunch of packages that help arrange data in R and includes a nice way of graphing things called "ggplot"


For the next bit, I'm going to assume you have done the stats and have columns to match that are labeled with "protein", "log2foldchange", "pvalue", and "padjusted". It is also helpful to have a column called "threshold" that indicates those that are statistically significant (TRUE (p 0.05) in the column - remember capitals are important!).


In the bottom left type (copy and paste if you prefer), then hit enter after each line if nothing happens. This loads the packages to R so it can use them. You will need to do these lines each time you try to use this program, but once it is loaded you don't need to do it again. First is the tidyverse. and then loads readxl (which is used to load your file). Text following "#" are comments - this can contain anything, and is used to help remember what you have done and why! Also note that R doesn't like spaces in file names or titles of columns. You can use something like "my_file" but not "my file". If you have spaces in your excel column names, either remove them, or R will replace the space with a "." when loading. You can also type/copy & paste this into the top left and then save (it is then a mini-program!)


This should pop open a window to choose your working file and then set the working directory to where that file is, so that any outputs (e.g. graphs) will be returned to the same location as your initial file - it's bad programming form, but useful for you.


Now, load the file into R, change the "your_file" and "file.xlsx" to something memorable for you. You will need to change these file names each time you analyze a new file, so that you don't overwrite the data:


This makes a graph file called "your_volcano_plot" (change as necessary) and assigns it a bunch of features based on your column names. Change the "TITLE OF YOUR PLOT", it is what comes out as the title on your plot. Now you can't see this plot just yet...


This should open the plot viewer in bottom right, so that you can have a look at your volcano plot. There is one more step so that you have the plot output as you want it - a file that you can work with to view in another program.


This outputs the file to your working directory (remember setting it near the start) - it'll be in pdf format. You can change as you want to jpg, png, tiff, bmp etc. I have specified a size, you can adjust those as needed. These are in inches currently, change as you like.


If you saved this as a program in the top left, then return the cursor to the top line of the program (before the line starting "library"). Click the "run" button at the top right of the program window. You will need to click run for most lines of the program, until it reaches the bottom of the program. You should now have a volcano plot. If it doesn't work - google the errors or let me know and I'll try to help.


Comments:I have used GraphPad to calculate IC50, EC50 in fluorescence-based assays for my scientific project. I've also generated my charts, figures and reports directly to publish in peer-reviewed journals because of their quality. It's an amazing software and it allowed me to transition from Excel and Origin and only use GraphPad now. However it's a bit heavy for my laptop.


GraphPad allows you to have one month free trial with all the features. That's an incredible advantage for people who's learning and selecting their "go-to" statistic software. It's pretty easy to import data and replicates and perform analysis over these replicates. I've tried other software before and GraphPad is way easier to use and complete. Figures are amazing and you can easily add standard deviation, which is a must to in scientific publications.


GraphPad tutorial is good and very informative. However, IC50 calculation tutorials could be improved adding real life examples with common absorbance or fluorescence experiments. Also, dose-response curves deserve more information and details about real life examples as well.About the software specifically, I would suggest to make it lighter, or develop a light version. My computer has 8GB RAM and good processor and still takes a lot to generate charts and analysis.


Comments:Analyzing datasets containing thousands of data points has never been easier! An easy to use click interface, beautiful publication-ready customizable figures, and all the statistical details you need in a click of a button.


This software does not offer scripting capability that is as useful as R or python. That being said, this software is designed to offer a click interface, and not a command line interface. It is meant to be easy to use with a click interface. It would be nice if both a command line and a click interface can co-exist for this software.


Comments:The software is good, but I have been over charged for two license at the same time and they never refunded me. I don't recommend anyone to get involve with graphpad licensing. Don't go for it except your company purchase the license because if they charge you extra, there is not even a phone number to call to solve the issue. You have to wait for weeks to hear back from them, when you are money is actually gone.


The easy to use statistical analysis where the uses for each test are clearly described so that you can apply the right test to your data set. This way you not only use the right test but understand why that test is applicable to your type of data.


Comments: I have used GraphPad to calculate IC50, EC50 in fluorescence-based assays for my scientific project. I've also generated my charts, figures and reports directly to publish in peer-reviewed journals because of their quality. It's an amazing software and it allowed me to transition from Excel and Origin and only use GraphPad now. However it's a bit heavy for my laptop.


GraphPad tutorial is good and very informative. However, IC50 calculation tutorials could be improved adding real life examples with common absorbance or fluorescence experiments. Also, dose-response curves deserve more information and details about real life examples as well. About the software specifically, I would suggest to make it lighter, or develop a light version. My computer has 8GB RAM and good processor and still takes a lot to generate charts and analysis.


Comments: Analyzing datasets containing thousands of data points has never been easier! An easy to use click interface, beautiful publication-ready customizable figures, and all the statistical details you need in a click of a button.

3a8082e126
Reply all
Reply to author
Forward
0 new messages