RaySin on GitHub

Course : Deep Learning for Human Language Processing - Speech Separation

2020-04-12
NLP

Watch the NTU cource by youtube.

TasNet - Time-domain Audio Separation Network

Architecture

  • Encoder

    • 512-d : 不一定要positive

  • Decoder

    • 跟Encoder互為Inverse的效果沒比較好

  • Separator

    • Network Compression
      • Depthwise Separable Convolution
        • Steps
          1. Depthwise Convolution
            • Filter number = Input channel number
            • Each filter only consider on channel
            • The filters are k x k matrices
            • There is no interaction between channels
          2. Pointwise Convolution
            • 1 x 1 filter
        • Application
          • SqueezNet
          • MobileNet
          • ShuffleNet
          • Xception

Experiment

Reference


Comments

Content
Translator
Google AdSense
BloggerAds