50 speakers audio data with length more than 1 hour for each. Further, data converted to wav format, 16KHz, mono channel and is split into 1min chunks. This dataset can be used for speaker recognition kind of problems. This dataset was scraped from YouTube and Librivox.