block by joyrexus 524d2ab465e36e44d29e

Transcript validation notes

Resources for creating a simple web interface for validating tabular transcript data.

I still see a lot of transcription and coding done in excel, with metadata stored in an info worksheet, transcription and coding in another transcript worksheet. Each row is tied to a timestamped utterance or speech act. So, there are columns for the timestamp and utterance along with additional columns for whatever annotation is needed (e.g., coding for syntax, semantics, gesture, context, etc.)

Anyway, we want to avoid parsing excel files. Let Google Drive / Sheets do this for us.

With a published sheet, we can retrieve a JSON feed from the Spreadsheets Data API.

However, we may want to avoid publishing our transcripts. Alternatively, we can use a node client with authentication built-in to access the relevant worksheet data for validation.

So, what we want is a simple single-page web app that …

File uploads are straightforward with the File API.

Keep it simple!