How JustAnswer Works:
  • Ask an Expert
    Experts are full of valuable knowledge and are ready to help with any question. Credentials confirmed by a Fortune 500 verification firm.
  • Get a Professional Answer
    Via email, text message, or notification as you wait on our site. Ask follow up questions if you need to.
  • 100% Satisfaction Guarantee
    Rate the answer you receive.
Ask R.R. Jha Your Own Question
R.R. Jha
R.R. Jha, Tutor
Category: Homework
Satisfied Customers: 5471
Experience:  B.Tech
Type Your Homework Question Here...
R.R. Jha is online now
A new question is answered every 9 seconds

large data set and I need to do the following in R

This answer was rated:

Attachment: 2013-08-20_225351_new_data_-_august_16_2013.xlsx

I have a large data set and I need to do the following in R:

1. To do: Identify rows in the data that have the following error. For some child id, some friend ID's were written multiple times in one row. Find all of these errors and produce an excel spreadsheet with row numbers where this error occurs (do not fix them; Amori needs to fix them).

2. To do: Fix the following error. When the child is by him/herself, the quality variable should always be N/A. Attached is the data file

Welcome to JA!

I can write a script to make these changes. So, should only R be used or could I use another programming language to make these corrections?

And could you please send one example of both requirements? So that I could be sure.

Customer: replied 4 years ago.

That would be great!


It would be preferred if it was written in R as there are other scripts that deal with the data set that have already been written in R and would be easier to integrate all of them.


To Do #1: For example if I looked at row 7858, it will have session 1, day 10, childid 664, coder kanesha, column f1 through f10 represents the id of children which childid 664 was playing with at the time, if for example 506 is written twice in any of the f1-f10 columns then the script should output this as an error in an excel spreadsheet indicating which row and column it was found (this is not a real example from the data, but it gives you an idea of what needs to be looked for in the data)


To Do #2: For example say row 2, session 1, day 3, childid 501, coder Katie, if all of the columns f1-f10 are NA then the column quality should also say NA (not 2 or 1) note: this is also not a real example from the data


Thank you! Hope this was a clear explanation. Please let me know if you need any more details.

Ok, got it. I'll include a script within the excel file to do so. You can run the script using a shortcut key I'll assign, then it will search for these two error in the active spreadsheet and generate results.
Customer: replied 4 years ago.
Perfect, thank you. Will I be able to see and run the code in R studio as well as the shortcut key?
Not sure about R-studio. But you would be able to run it from inside Microsoft Excel.
Customer: replied 4 years ago.
Hmmm, the only problem with that is that this code needs to be integrated into a larger R script.
You can open the excel file using 'Open existing file...' option in R-studio, then press the shortcut key to run script inside excel and save it. Would it not help?
Customer: replied 4 years ago.
Unfortunately no, I need it written in R.
Alright. How are you importing excel file in R?
Customer: replied 4 years ago.

With the following: excel file must first be saved as csv


# XXXXX working directory

#read in excel file
setwd("C:/Users/Jenny PC/Documents/Academics/Amori and Vika") #this is your working directory where the data file is saved
data = read.csv("New Data - August 16 2013.csv")


and then outputting excel spreadsheet with the following:

write.csv(data,"C:/Users/Jenny PC/Documents/Academics/Amori and Vika/Output Stuff/New Data.csv")


this is the working directing where you want the output produced and the name of the csv you want.

Ok, I'll prepare the script in R
Customer: replied 4 years ago.
Thank you!
You're very much welcome!

You need to spend $3 to view this post. Add Funds to your account and buy credits.
R.R. Jha and other Homework Specialists are ready to help you
Customer: replied 4 years ago.

Are you able to provide comments on your code so that I am able to know what each line is doing?

Ok, I'll do that.
Customer: replied 4 years ago.

Much appreciated!

Customer: replied 4 years ago.

Also did you do part 2 of this as well? or just the first part?

I did both parts. I've updated the script with comments. Please download it from previous link again.

Customer: replied 4 years ago.

Thank you! You were a great help!

You're most welcome!
Customer: replied 4 years ago.
Hi RRJha,

I have some more R script that needs to be written, would you be interested?
Sure, I'd be glad to help. You may post the question here.