-
Notifications
You must be signed in to change notification settings - Fork 579
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve error message in spark tools when trying to access a local file from other nodes #1417
Comments
this is on yarn on dataflow01 |
This same issue is coming up in a number of different ways. What's happening is that we ask for the
|
If you |
I was assuming that this would be fixed by #1433, but it only fixed the inverse of this problem. It's possible now to load HDFS files from the local runner using the full namenode path i.e. Loading files with the sparkRunner and yarn-client is still failing. We're getting a new error now though.
|
We've been hit with a collective case of confusion. Of course this is going to fail. We're accessing a file on local. This is not visible to the executors on the other nodes. Thus the explosion. |
Doh! Perhaps we could improve the error message in this case? |
Yes. We desperately need to improve error messages for hadoop bam. |
Updated the title of this ticket to clarify that the task is now just to improve the error message, not fix an actual bug. |
this is the same as #1452. The bug is in hadoop-bam - let's not put a bandaid for that here. |
opened HadoopGenomics/Hadoop-BAM#79 blocked until that is fixed. not alpha-1 |
@akiezun I created a fix for HadoopGenomics/Hadoop-BAM#79 in HadoopGenomics/Hadoop-BAM#99, which should fix this. Can you take a look? |
yes, though not today |
@tomwhite So I'm running on the latest gatk which uses hadoop-bam 7.6.0 (which i think includes those fixes) and I still get this error. I'm running on a dataproc cluster and my exact commandline is:
the file Can you reproduce it too? What do you get? |
assigning to @droazen for dispatch |
I'm almost certain this used to work.
the error is
It's fine when running a LOCAL runner, or when the file is on HDFS.
When resolving the ticket, make sure to devise a way (or at least enter a ticket) to prevent this from happening again - ie some way to discover this kind of problem.
The text was updated successfully, but these errors were encountered: