Microsoft is broadening the range of supported file formats in SharePoint Premium Document Translation (formerly also known as Microsoft Syntex). Alongside this enhancement, the feature has been rebranded to SharePoint Content AI Document Translation.
This update brings it closer in alignment with Azure Document Translation, enabling new use cases for translating technical documents, web content, and structured data.
Timeline
The rollout is expected to be completed in July 2025.
How does this affect your organization?
Users can now translate a broader range of file formats in SharePoint and OneDrive.
Previously, SharePoint Content AI Document Translation (formerly known as SharePoint Premium Document Translation) supported the following formats:
.csv, .docx, .htm, .html, .markdown, .md, .msg, .pdf, .pptx, .txt, and .xlsx.
With the latest update, the feature now also supports these additional file formats:
- Markdown files: .mdown, .mdtext, .mdtxt, .mdwn, .mkd, .mkdn
- Web archive formats: .mht, .mhtml
- Open formats: .odt (OpenDocument Text), .rmd (R Markdown)
- Structured data: .tab, .tsv
- Localization formats: .xlf (XLIFF)
The expanded file format support enables users to:
- Translate developer documentation and technical content written in Markdown
- Support translation of content exported from open-source tools such as LibreOffice
- Localize eLearning and software files using XLIFF (.xlf)
- Translate structured data files (.tsv, .tab) without file conversion
This update reduces manual effort and expands translation capabilities for file formats commonly used across organizations.
Keep in mind that SharePoint Content AI Document Translation supports files up to 40 MB in size.
Here is an example of a new file format, mhtml (Web archive format).
ChatGPT prepared this sample in English.

SharePoint Content AI Document Translation translated the file into German in less than 10 seconds.

As a reminder, all SharePoint Premium features are billed through Microsoft Syntex Pay-as-you-go.
Document translation > $15.00/1M characters
The number of characters processed. Character count includes letters, Unicode code points, punctuation, and white spaces.