Orchestrating video transcoding in Ruby: A story
Michał Matyas
@nerdblogpl
https://nerdblog.pl
You can watch the slides live on your phone/laptop
https://tiny.cc/transcoding
I will avoid talking about the business side of the project.
Please don't ask, thank you!
Introduction
There was once a project...
Lesson #1
Hindsight 20/20
It's easy to look back and think "this should've been obvious" but you usually lack all the knowledge and experience at the time.
class Video < ActiveRecord::Base
mount_uploader VideoUploader
process_in_background UploadWorker
before_save
before_processing
Video.status = processing
carrierwave-video (ffmpeg)
after_processing_success
after_processing_failure
Video.status = ready
Video.status = failed
CALLBACK ALL THINGS!!!
(please don't)
Lesson #2
Avoid callbacks
Callbacks make your code harder to reason and harder to isolate, slowly turning everything into a tightly coupled co-dependent mess.
Carrierwave versions
include CarrierWave::Video
include ::CarrierWave::Backgrounder::Delay
storage :file
version :mp4 do
process :encode_video => [:mp4]
end
version :webm do
process :encode_video => [:webm]
end
Carrierwave versions
support_format :mp4_1080p, {
resolution: '1920x1080',
progress: :on_progress_1080p,
if: :allow_1080p?
}
support_format :mp4_720p, {
resolution: '1920x1080',
progress: :on_progress_720p
}
support_format :mp4_480p, {
resolution: '852x480',
custom: '-preset ultrafast -g 5',
progress: :on_progress_480p
}
Lesson #3
Don't use Carrierwave versions
Or at least don't use them for anything more complex. They are good for other things (probably).
Consider using Shrine instead.
Video
kind: "720p"
Version
kind: "480p"
Version
kind: nil
Version
kind: "720p"
special: true
Version
kind: "480p"
special: true
Version
kind: nil
special: true
Version
class Video < ActiveRecord::Base
mount_uploader VideoUploader
process_in_background UploadWorker
before_save
before_processing
Video.status = processing
carrierwave-video (ffmpeg)
after_processing_success
after_processing_failure
Video.status = ready
Video.status = failed
class Version < ActiveRecord::Base
mount_uploader VersionUploader
process_in_background VersionUploadWorker
before_save
before_processing
Version.status = processing
carrierwave-video (ffmpeg)
after_processing_success
after_processing_failure
Version.status = ready
Version.status = failed
class Version < ActiveRecord::Base
mount_uploader VersionUploader
process_in_background VersionUploadWorker
before_save
before_processing
Version.status = processing
carrierwave-video (ffmpeg)
after_processing_success
after_processing_failure
Version.status = ready
Version.status = failed
class Version < ActiveRecord::Base
mount_uploader VersionUploader
process_in_background VersionUploadWorker
before_save
before_processing
Version.status = processing
carrierwave-video (ffmpeg)
after_processing_success
after_processing_failure
Version.status = ready
Version.status = failed
class Video < ActiveRecord::Base
Video.status = processing
Video.status = ready
Video
original version gets uploaded
ProcessingWorker
checks for existing versions
store in local cache
creates missing ones
starts processing based on the priority
(low quality video - higher priority because it's gonna be ready fastest)
uploads to S3
(in a separate worker once the version is ready)
processing uses streamio-ffmpeg directly to pass custom arguments to ffmpeg based on the video format and allows transcoding from other versions rather than original file (much much faster)
Lesson #4
Simplest solutions are often best solutions
We would have avoided a lot of pain if we didn't try to do things The Rails Way™ but went with the simplest solution instead.
You live and learn!
But why didn't you use...
- AWS Lambda - 15 minute of max execution time
- Zencoder, Amazon Elastic Transcoder etc. - expensive
- Docker - not that popular (and well supported in production) at the time
Let's talk about MP4
- FTYP - File identification
- MDAT - media data (streams)
- MOOV - metadata
Tip #1
Copy streams!
When transcoding things, the best approach is to copy streams and match codecs as much as possible because it's way way faster.
MOOV IS MADE OF ATOMS
- Fixed parameters of the file
- Specific pointers for each frame of data and audio
MOOV IS MADE OF ATOMS
- TRKH - file times, duration, speed etc.
- MDIA - Media Data
- STCO - start of each video/audio chunk
- STSC - Sample to chunk atom
- STSZ - Sample size atom
Tip #2
Optimize for streaming
ffmpeg has arguments that make video better at streaming.
The most important:
-movflags +faststart
Tip #3
Bring Your Own Arguments
When using streamio-ffmpeg just ignore all the DSL they have and go for direct passing of as many arguments as you can.
Tip #4
There are no universal solutions
When transcoding videos from network, sometimes it's faster to download-then-process, sometimes to process on the fly. It heavily depends on the codec though.
Tip #5
Use presets
ffmpeg already has presets for most common configuration options but it's a balance between speed and quality. Use fast presets for low resolution videos.
Tip #6
Use profiles
H.264 has profiles (sets of codec features) but not every device will support all of them.
Go with the highest one you can afford.
Tip #7
Always convert to YUV420
There are different pixel formats that define how color information is stored. Use -pix_fmt yuv420 for max compatibility in browsers.
Tip #8
Collect metadata
It is always a good idea to collect metadata from ffprobe for both the incoming video and processed versions to look for clues when things break
Tip #9
Trust but verify
Sometimes file can be cut during transcoding and still technically be "valid", even if incomplete.
BUT...
Tip #10
Trust but verify manually
Be super-careful about invalidating the video or audio based on the duration given from ffprobe. It's sometimes approximated without a warning.
That's it, folks!
@nerdblogpl
https://nerdblog.pl
sources:
- https://trac.ffmpeg.org/wiki/Encode/H.264
- https://www.dfrws.org/sites/default/files/session-files/pres-video_authentication_using_file_structure_and_metadata.pdf
- http://xhelmboyx.tripod.com/formats/mp4-layout.txt
- https://www.cnwrecovery.com/manual/MP4FileStructure.html
Transcoding videos in Ruby: A story
By Michał Matyas
Transcoding videos in Ruby: A story
- 2,598