With HTML you can specify the character set using a meta element. Here is the declaration for a document written in HTML 4.01:
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
And the much more memorable HTML5 version:
<meta charset="UTF-8">