📚 The CoCalc Library - books, templates and other resources
cocalc-examples / martinthoma-latex-examples / documents / bachelor-proposal / bachelor-proposal.tex
132928 viewsLicense: OTHER
\documentclass[a4paper]{scrartcl}1\usepackage{amssymb, amsmath} % needed for math2\usepackage[utf8]{inputenc} % this is needed for umlauts3\usepackage[english]{babel} % this is needed for umlauts4\usepackage[T1]{fontenc} % this is needed for correct output of umlauts in pdf5\usepackage[margin=2.5cm]{geometry} %layout6\usepackage{hyperref} % links im text7\usepackage{color}8\usepackage{framed}9\usepackage{enumerate} % for advanced numbering of lists10\usepackage{csquotes}11\usepackage{ifxetex,ifluatex}12\usepackage{etoolbox}13\usepackage[svgnames]{xcolor}14\usepackage{tikz}15\usepackage{framed}16\usepackage{parskip}17\usepackage{cite}18\usepackage{mystyle}19\clubpenalty = 10000 % Schusterjungen verhindern20\widowpenalty = 10000 % Hurenkinder verhindern2122\hypersetup{23pdfauthor = {Martin Thoma},24pdfkeywords = {Bachelor proposal: },25pdftitle = {Bachelor proposal}26}2728%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%2930\begin{document}31\title{Proposal for a Bachelor of Science Thesis:\\Recognition of mathematical formulae in the Context of Lecture Translation}32\author{Martin Thoma}33\maketitle34\section{The problem backgound}35The KIT Lecture Translator, CMUSphinx, Android voice typing and36many other speech recognition systems have proven that it is possible to37recognize speech. But at the moment, there seems not to be a single38system that manages to recognize natural language math speech39recognition. For example, a term like40\[\sum_{n=1}^\infty \frac{1}{n^2} \rightarrow \infty \]41would naturally be spoken as4243\begin{shadequote}[l]{}44The sum of one divided by n squared for n from one to infinity diverges to infinity.45\end{shadequote}4647in natural language. Today, speech recognition systems do only48recognize the words spoken. They don't recognize that it was a49mathematical term which could and should be expressed with symbols.5051One way to extend an existing speech recognition $A$ systems would be52by the following steps:53\begin{enumerate}54\item $A$ recognizes speech and returns a text $T$. This text55has to contain anotations that indicate at which time56in the original recording the various parts of speech57were detected.58\item A math detecter parses $T$ and returns the time intervalls $I$59when math was detected.60\item A math parser tries to parse speech in $I$. This parser61can make use of a language model dedicated to math. It62returns weighted hypotheses which terms might have63been spoken.64\item Finally, a program compares the hypotheses with math65in a formula database. Many formulas might already been66written in \TeX{}, e.g. on Wikipedia, math.stackexchange.com67or in freely available \LaTeX{} / \TeX{} files.68\end{enumerate}69\break7071\section{The problem statement}72The bachelor's thesis at KIT is worth 15 ECTS. It should be73created within 4 months and at most 450 hours.7475This aim of this bachelor's thesis is to answer the following76questions:77\begin{itemize}78\item \textbf{Representation of Math:} How can math be expressed79for speech recognition in a textual way?80Especially:81\begin{itemize}82\item What reasons are there to use \TeX{}, which83reasons are there for MathML?84\item Are there alternatives?85\end{itemize}86\item \textbf{Detection:} How can parts of speech be detected87that contain math?88\begin{itemize}89\item Which keywords indicate mathematics?90\item Is a keyword-density based approach sufficient?91\end{itemize}92\item \textbf{Evalution of math recognition strength}:93\begin{itemize}94\item How can speech recognition systems be evaluated95for their strength in math recognition?96\item Is the \textbf{W}ord \textbf{E}rror \textbf{R}ate97to measure how well the recognition worked?98\end{itemize}99\item \textbf{Literature research:}100\begin{itemize}101\item Can \TeX{} be used as a grammar to recognize math speech?102\item Can MathML be used as a grammar to recognize math speech?103\end{itemize}104\end{itemize}105106Follow-up tasks, that will not be part of this bachelor's thesis,107include:108\begin{itemize}109\item \textbf{Other languages}: This thesis will focus on math110recognition for the English language. Follow-up work might111try to deal with math independant of the language.112\item \textbf{Implementation}: The aim of this thesis is not113to create a working math recognition.114\end{itemize}115116\section{Significance}117This thesis will create a basis for follow-up work in speech recognition118that contains mathematical content. It will enable people to evaluate119various speech2math recognition ideas. Also, it will give an overview120of the current state of art in math speech recognition and which121questions need to be tackled in feature.122123\section{Time schedule}124\begin{itemize}125\item[10h] Research of ways to represent math126\item[20h] Research ways how \TeX{} deals with math127\item[20h] Research how MathML deals with math128\item[50h] Recording math lectures129\item[100h] Annotating math lectures; writing the best130representation for mathematical terms contained in131these lectures132\item[10h] Finding keywords that indicate mathematical formulas133\item[5h] Test the keyword-approach with the annotated lectures134\end{itemize}135136\renewcommand\refname{Related Literature}137\nocite{*}138\bibliographystyle{itmalpha}139\bibliography{literatur}140141\section{Hypotheses}142I think that MathML will be the best way to represent math, because143it was designed to do this. MathML~3.0, the most recent version,144is a W3C recommendation since October 2001.145146\TeX{} in contrast is great in rendering mathematical equations,147but it grew over time. It existed even before the web was invented.148149Another reason why I think MathML might be favorable for internal150representation is that it was created to be parsed and written by151machines. It is an XML standard and as such you can apply XML tools152and libraries to parse it. \TeX{} on the other hand was created153to be written by humans.154155I'm pretty sure that it is hopless to create a grammar for math156in it's general form. But for some areas like boolean logic, arithmetic157or analysis it might work pretty well.158159\end{document}160161162