[Script Info] Title: [Events] Format: Layer, Start, End, Style, Name, MarginL, MarginR, MarginV, Effect, Text Dialogue: 0,0:00:04.28,0:00:09.28,Default,,0000,0000,0000,,Technical evolution of Wikipedia by Brion Vibber - Former CTO of Wikipedia Dialogue: 0,0:00:09.28,0:00:11.54,Default,,0000,0000,0000,,Good evening, everyone. Dialogue: 0,0:00:11.54,0:00:15.42,Default,,0000,0000,0000,,It's my pleasure to present to you, Brion Vibber. Dialogue: 0,0:00:15.42,0:00:23.48,Default,,0000,0000,0000,,For years, he's worked at the Wikimedia Foundation as their chief technical officer. Dialogue: 0,0:00:23.48,0:00:31.24,Default,,0000,0000,0000,,and I'm very happy, that Lu [the German translator] could come. He runs Esperantoland.org. Dialogue: 0,0:00:31.24,0:00:35.17,Default,,0000,0000,0000,,I'll give the floor to Brion. Dialogue: 0,0:00:35.17,0:00:52.48,Default,,0000,0000,0000,,Wikipedia: is there anyone who doesn't know about Wikipedia? Dialogue: 0,0:00:52.48,0:00:57.17,Default,,0000,0000,0000,,So, a bit of Wikipedia history Dialogue: 0,0:00:57.17,0:01:04.58,Default,,0000,0000,0000,,and mostly about the technical aspect of multilingual support. Dialogue: 0,0:01:04.58,0:01:14.42,Default,,0000,0000,0000,,Originally when the Wikipedia was founded, it was only in English. Dialogue: 0,0:01:14.42,0:01:22.04,Default,,0000,0000,0000,,Now something which is nice and easy about English is that it doesn't have any accented characters. Dialogue: 0,0:01:22.04,0:01:30.42,Default,,0000,0000,0000,,But of course, an important problem with that is that the American programmers, like myself, Dialogue: 0,0:01:30.42,0:01:38.04,Default,,0000,0000,0000,,don't know much about problems concerning letters and writing systems in other languages, Dialogue: 0,0:01:38.04,0:01:48.04,Default,,0000,0000,0000,,and because of that, a lot of software and websites don't handle languages well, Dialogue: 0,0:01:48.04,0:01:54.25,Default,,0000,0000,0000,,except those from Western Europe. Dialogue: 0,0:01:54.25,0:01:59.25,Default,,0000,0000,0000,,Many are interested in supporting other languages, Dialogue: 0,0:01:59.25,0:02:07.52,Default,,0000,0000,0000,,so we can take all human knowledge to everyone on earth. Dialogue: 0,0:02:07.52,0:02:16.38,Default,,0000,0000,0000,,So, it would be good to support other languages, but that didn't work well at first. Dialogue: 0,0:02:16.38,0:02:22.33,Default,,0000,0000,0000,,I set up many websites for Wikipedia in many languages, Dialogue: 0,0:02:22.33,0:02:25.17,Default,,0000,0000,0000,,several dozen languages, in fact. Dialogue: 0,0:02:25.17,0:02:33.42,Default,,0000,0000,0000,,But many of them were totally messed up, for example the Japanese Wikipedia. Dialogue: 0,0:02:33.42,0:02:40.92,Default,,0000,0000,0000,,Now, it can be written well. It has many characters. Dialogue: 0,0:02:40.92,0:02:47.00,Default,,0000,0000,0000,,It looks good and one can read and write. Everything works well now. Dialogue: 0,0:02:47.00,0:02:51.88,Default,,0000,0000,0000,,But originally, it looked very similar to that. Dialogue: 0,0:02:51.88,0:03:01.83,Default,,0000,0000,0000,,It remained an important problem for many languages: Japanese, Chinese, Russian, Hebrew, etc. Dialogue: 0,0:03:01.83,0:03:05.88,Default,,0000,0000,0000,,Many of them didn't work at all. Dialogue: 0,0:03:05.88,0:03:19.57,Default,,0000,0000,0000,,At one time, Polish even set up its own website for its wiki, Dialogue: 0,0:03:19.57,0:03:28.04,Default,,0000,0000,0000,,which supported Eastern European letters well, but still not Japanese nor Russian, etc. Dialogue: 0,0:03:28.04,0:03:35.74,Default,,0000,0000,0000,,At that time, I started to get to know Wikipedia through Esperanto. Dialogue: 0,0:03:35.74,0:03:44.77,Default,,0000,0000,0000,,I was a university student and I learned French at a normal course. Dialogue: 0,0:03:44.77,0:03:55.04,Default,,0000,0000,0000,,But I became interested in other languages, and I taught myself Esperanto online and through books, etc. Dialogue: 0,0:03:55.04,0:04:00.21,Default,,0000,0000,0000,,On my computer and online, it was very interesting, Dialogue: 0,0:04:00.21,0:04:08.29,Default,,0000,0000,0000,,and the Esperanto Wikipedia was founded by our dear Chuck. Dialogue: 0,0:04:08.29,0:04:20.95,Default,,0000,0000,0000,,At that time, we still had a messed up character set, well for Esperanto, which has accented characters. Dialogue: 0,0:04:23.72,0:04:31.00,Default,,0000,0000,0000,,That way it has an accent on the letters. Dialogue: 0,0:04:31.00,0:04:40.27,Default,,0000,0000,0000,,But, for it to be written on the webpage, you had to write using the "x system". Dialogue: 0,0:04:40.27,0:04:50.48,Default,,0000,0000,0000,,So, "cx" changes to "ĉ", etc. It looks really ugly. Dialogue: 0,0:04:50.48,0:05:07.48,Default,,0000,0000,0000,,To make it more beautiful and show it the way it should be, I added Unicode support. Dialogue: 0,0:05:07.48,0:05:18.54,Default,,0000,0000,0000,,Unicode is a system to encode characters for every language in the world: Dialogue: 0,0:05:18.54,0:05:28.27,Default,,0000,0000,0000,,from Egyptian hieroglyphics to modern Japanese and Korean as well as many symbols Dialogue: 0,0:05:28.27,0:05:35.38,Default,,0000,0000,0000,,in one system, which can include all of them. Dialogue: 0,0:05:35.38,0:05:44.79,Default,,0000,0000,0000,,So that, we don't need a separate Polish system for Eastern Europe, Dialogue: 0,0:05:44.79,0:05:48.67,Default,,0000,0000,0000,,and French for Western Europe, etc. Dialogue: 0,0:05:48.67,0:05:58.19,Default,,0000,0000,0000,,We can have just one system, one program, one website for every language. Dialogue: 0,0:05:58.19,0:06:12.88,Default,,0000,0000,0000,,With that worldwide system, it was started in computers already 20 years ago, Dialogue: 0,0:06:12.88,0:06:23.42,Default,,0000,0000,0000,,but in 2001 or 2002, when we founded Wikipedia, Unicode was still "new" online, Dialogue: 0,0:06:23.42,0:06:30.33,Default,,0000,0000,0000,,so it was difficult to use it in "American" programs. Dialogue: 0,0:06:30.33,0:06:42.21,Default,,0000,0000,0000,,You had to kind of study how UTF8 works to put Unicode in a web page. Dialogue: 0,0:06:42.21,0:06:52.85,Default,,0000,0000,0000,,But, I was able to study it a bit, and I added support to Wikipedia's original software. Dialogue: 0,0:06:52.85,0:07:04.21,Default,,0000,0000,0000,,I gave it a converter from the "sx" to the correct "ŝ", etc. Dialogue: 0,0:07:04.21,0:07:10.17,Default,,0000,0000,0000,,But I found that it's not just for Esperanto. Dialogue: 0,0:07:10.17,0:07:14.12,Default,,0000,0000,0000,,It can work for other languages as well. Dialogue: 0,0:07:14.12,0:07:21.50,Default,,0000,0000,0000,,For example Russian, Japanese and Polish can work with Unicode. Dialogue: 0,0:07:21.50,0:07:27.75,Default,,0000,0000,0000,,Unfortunately, it was a bit more complicated, Dialogue: 0,0:07:27.75,0:07:40.92,Default,,0000,0000,0000,,because at that time we also upgraded to new Wikipedia software, which was completely different from the original. Dialogue: 0,0:07:40.92,0:07:47.79,Default,,0000,0000,0000,,It was better, but it still didn't support Unicode. Dialogue: 0,0:07:47.79,0:07:56.82,Default,,0000,0000,0000,,Of course, it was created by Western Europeans and Americans, Dialogue: 0,0:07:56.82,0:08:04.58,Default,,0000,0000,0000,,and it didn't know there were other languages other than in Western Europe and North America Dialogue: 0,0:08:04.58,0:08:08.06,Default,,0000,0000,0000,,which have other letters. Dialogue: 0,0:08:08.06,0:08:16.88,Default,,0000,0000,0000,,So, that's why it was necessary to add Unicode support three times. Dialogue: 0,0:08:16.88,0:08:20.81,Default,,0000,0000,0000,,Originally for the Esperanto Wikipedia. Dialogue: 0,0:08:20.81,0:08:33.96,Default,,0000,0000,0000,,The second time for the new system, which was originally created for the English Wikipedia and didn't need Unicode. Dialogue: 0,0:08:33.96,0:08:43.83,Default,,0000,0000,0000,,And again when we completely changed the software to speed it up, Dialogue: 0,0:08:43.83,0:08:49.00,Default,,0000,0000,0000,,but then it was completed the third time. Dialogue: 0,0:08:49.00,0:08:59.33,Default,,0000,0000,0000,,In 2002 and 2003, we tried to start new Wikipedias in many languages. Dialogue: 0,0:08:59.33,0:09:12.58,Default,,0000,0000,0000,,We reacquired Polish and were able to better unite it with the other languages. Dialogue: 0,0:09:12.58,0:09:21.67,Default,,0000,0000,0000,,For example, one language can link to a page about the same thing in another language. Dialogue: 0,0:09:21.67,0:09:29.46,Default,,0000,0000,0000,,Now with the same system for everything, one can do that. Dialogue: 0,0:09:29.46,0:09:38.83,Default,,0000,0000,0000,,It's better to combine the groups in their own language. Dialogue: 0,0:09:38.83,0:09:45.75,Default,,0000,0000,0000,,Similarly, there were other problems for the languages in the program online. Dialogue: 0,0:09:45.75,0:09:53.67,Default,,0000,0000,0000,,It was somewhat problematic, that the traditional American programmers Dialogue: 0,0:09:53.67,0:10:06.16,Default,,0000,0000,0000,,and often even the Western Europeans created their own programs only in English. Dialogue: 0,0:10:06.16,0:10:15.75,Default,,0000,0000,0000,,It was a problem when someone didn't know English or didn't know it well Dialogue: 0,0:10:15.75,0:10:20.96,Default,,0000,0000,0000,,or just wanted to use a system in their own language. Dialogue: 0,0:10:20.96,0:10:31.75,Default,,0000,0000,0000,,Because of that, we also had to add a system to translate messages from the websites, Dialogue: 0,0:10:31.75,0:10:37.96,Default,,0000,0000,0000,,so everyone can understand it in their own language. Dialogue: 0,0:10:37.96,0:10:52.42,Default,,0000,0000,0000,,For example, we can see ... Article, Discussion, History, Delete Dialogue: 0,0:10:52.42,0:10:57.04,Default,,0000,0000,0000,,"Article", "Talk", "Edit", "History", only in English. Dialogue: 0,0:10:57.04,0:11:01.46,Default,,0000,0000,0000,,It's not very good, though generally one understands English. Dialogue: 0,0:11:01.46,0:11:23.38,Default,,0000,0000,0000,,So, we created a map between the messages and a short description about every message. Dialogue: 0,0:11:23.38,0:11:41.38,Default,,0000,0000,0000,,When we have something larger, long messages, and there are sentences and paragraphs, etc. Dialogue: 0,0:11:41.38,0:11:46.14,Default,,0000,0000,0000,,It's a bit more complicated than simple words. Dialogue: 0,0:11:46.14,0:11:54.38,Default,,0000,0000,0000,,That's why we give a name for every message. Dialogue: 0,0:11:54.38,0:12:10.38,Default,,0000,0000,0000,,In the program, it doesn't have an English sentence, it just has a name which is "login-message" or the like. Dialogue: 0,0:12:10.38,0:12:21.55,Default,,0000,0000,0000,,In a file with the map for each individual language, is the name and the message. Dialogue: 0,0:12:21.55,0:12:25.73,Default,,0000,0000,0000,,The message can be translated into every language. Dialogue: 0,0:12:25.73,0:12:36.00,Default,,0000,0000,0000,,Similar systems are used in many programs of various kinds, Dialogue: 0,0:12:36.00,0:12:50.08,Default,,0000,0000,0000,,but what is most different about the Wikipedia system, is that one can also {\i1}change{\i0} that message. Dialogue: 0,0:12:50.08,0:13:13.56,Default,,0000,0000,0000,,If I want to change that sentence a bit, so that my Wikipedia can have a standard or rule Dialogue: 0,0:13:13.56,0:13:24.00,Default,,0000,0000,0000,,how one writes an article or choose administrators, etc. Dialogue: 0,0:13:24.00,0:13:30.63,Default,,0000,0000,0000,,It can be different in the Wikipedia system. Dialogue: 0,0:13:30.63,0:13:36.34,Default,,0000,0000,0000,,One can ... I'm not logged in, so I can't ... Dialogue: 0,0:13:36.34,0:13:48.58,Default,,0000,0000,0000,,but the website administrators can use the wiki to change their own messages. Dialogue: 0,0:13:48.58,0:13:54.38,Default,,0000,0000,0000,,[Unfortunately then, my camera stopped working.]