01-17-2021 03:15 AM - edited 01-17-2021 03:22 AM
Hi!
I have a transcript which I read from during the recording. Later Udemy creates VTT, which ideally should be the transcript again with added timings. However due to the course, many terms are used, that the VTT generator not knows and hence it messes up almost every generated sentence. In addition, I am not a native English speaker.
Therefore I came up with a tool that merges the VTT timings with the transcript using some kind of fuzzy logic to fine the best matching parts. So for instance the content of the VTT generated by Udemy such as
03:42.650 --> 03:50.570
And you may need an original, preferably the discontinued Dumela, not on the right or the Autoline
03:51.080 --> 03:51.710
on the left.
03:53.000 --> 04:00.080
If you go to a store, watch out that the MCU that is definitely featuring HP, which is located here.
04:01.110 --> 04:04.380
Is not Soyland to the board and can be removed.
As you can see, it is a mess and I am sure, it will definitely not help in understanding the content. The tool searches the transcript for matching sentences, finds them in this paragraph
Slide 6
And you will need an Arduino. Preferably the discontinued Duemilanove on the right or the Arduino Uno on the left. If you go to a computer store, watch out that the MCU, that big black processor in the middle is like that and not soldered to the board. Otherwise you will need some separate ATmega328Ps. In this course, you will need at least 2 of those Arduinos, better would be 3.
and converts this mess into
00:03:42.650 --> 00:03:50.570
And you will need an Arduino. Preferably the discontinued Duemilanove on the right or the Arduino Uno
00:03:51.080 --> 00:03:51.710
on the left.
00:03:53.000 --> 00:04:00.080
If you go to a computer store, watch out that the MCU, that big black processor in the middle is like that
00:04:01.110 --> 00:04:04.380
and not soldered to the board.
It is not perfect yet and is quite slow (it takes like an hour for 20 hours of lecture material on machines with 128 CPUs in total). My question is, do you know a tool which can do the same just with higher accuracy and faster?
Thanks,
Alex
01-20-2021 06:50 PM
Hey, I dont know it yet, tell us the tool which you are using. I am okay with slow speed
01-21-2021 01:00 AM
Hey HSE-Trainer,
I wrote it by myself. Currently it is a small script in Python that has been put together. I did a quick search could not find anything and just wrote it. However, I might have overlooked something in my search, so I wanted to ask, if a tool like that already exists.
If you are interested in using it, I have to make the source more beautiful first. 🙂
Alex
User | Likes Count |
---|---|
7 | |
3 | |
3 | |
3 | |
3 |
Ooops, it looks like you’re not a registered Udemy instructor. Want to become an instructor? We’d love to have you!
Become an instructor Already an instructor? Sign in
You can join the Instructor Club after you publish your first course. In the meantime, you’ll find plenty of help and advice in Studio U.