Data: Do You Need to Collect It?
There is a wealth of data available, much of which has been collected through large surveys and medical studies, but has yet to be analyzed. Before deciding that you must collect your own data, consider:
- Is the information you want already available in other datasets that have yet to be analyzed?
- Are you planning to measure the health outcome or predictor measures in a way that has not been done before?
- Do you have the funds to collect your data, or would it be more reasonable to do secondary data analyses?
Collecting data is an expensive endeavor, particularly for students who typically have limited funds and time limits for the projects they choose to undertake. If the measures you want to assess (including the health outcome and the potential predictors of that outcome) are available in an existing dataset, secondary data analyses may be more appropriate for you.
Data Collection Decisions
If you decide there is a need to collect your data, carefully consider what can influence the data you can collect. Any type of data collection, whether quantitative, qualitative, or a combination of both, require significant planning and detail. Consider the following:
- What patients or target population do you want to examine?
- Do you have a questionnaire that you want to deliver? Have the questions been validated in your target population?
- How will you establish your sample from your target population? Will this require sophisticated sampling techniques that you need help with?
- Have other studies examined the reliability and validity of the measures you want to use?
- Are you proposing a new measure of a health outcome that should be compared to other previously established measures?
- Will you have multiple interviewers/data collectors? How will you measure inter-rater and intra-rater reliability?
- Have you received institutional review board approval for the study you want to do?
- Have you thought about what you will do if the data collection does not go as planned (What will you do if you are unable to get enough participants within your expected time frame? What will you do if many subjects drop out of your study)?
These types of decisions can strongly impact not only whether you are able to collect data, but the quality of any data you are able to collect, as well. Although collecting data in itself may not be the most difficult endeavor you face, having high quality data that allows you to make inferences to the target population of interest with respect to the health outcome you study is very difficult, and requires substantial planning.