Skip to main content

Document Scan

The Docutain Document Scanner SDK for Android comes with integrated, ready to use UI components for the document scan process. Colors and icons can be changed to match your branding.

It is also possible to use the Document Scanner on imported images.

ScanEditPreview


Initialization

Make sure you have defined the following package dependencies in your app's build.gradle file:

def docutainSdkVersion = '1.7.0.3'
//For Document Scanner components
implementation("de.docutain:Docutain-SDK-UI:$docutainSdkVersion")

Initialize the Docutain Android Scanner SDK as described here.

Scan with Camera

Camera Permission

Declare permissions to use the camera in your AndroidManifest.xml:

<uses-permission android:name="android.permission.CAMERA"/>
<uses-feature android:name="android.hardware.camera"/>
info

Runtime permission for camera is handled automatically by the Docutain SDK.

Start Camera Scan

To start the scan process you only have to launch an ActivityResultLauncher with our predefined ScanResult contract and wait for it to return.

An instance of DocumentScannerConfiguration is required to launch the document scanner. It provides the possibility to change some behaviours to adopt it to your needs. See Change default scan behaviour for possible custom settings.

import de.docutain.sdk.ui.ScanResult

val documentScanResult = registerForActivityResult(ScanResult()) { result ->
if(result){
//user finished scan process, continue with your workflow
//generate PDF by using Document.writePDF()
//get detected Text by using DocumentDataReader.getText()
//get data by using DocumentDataReader.analyze()
} else{
//user canceled scan process
}
}

myButton.setOnClickListener {
val scanConfig = DocumentScannerConfiguration()
documentScanResult.launch(scanConfig)
}

Scan from imported images

It is also possible to use the Document Scanner on already taken images, for example images selected from the users photo gallery. The process of starting the scanner on imported images is the same as when scanning with the camera. The only difference is defining a different source in the DocumentScannerConfiguration. Possible values are:

  • CAMERA : This is the default value. Starts the Document Scanner using the devices camera.
  • IMAGE : Starts the Document Scanner on images provided by you via code. Pass sourceImages to the DocumentScannerConfiguration containing the paths to the images to be scanned.
  • GALLERY : Opens the user's photo gallery in single select mode. The Document Scanner is run on the selected image.
  • GALLERY_MULTIPLE : Opens the user's photo gallery in multi select mode. The Document Scanner is run on the selected images.

The following sample shows how to open the photo gallery in multi select mode and run the Document Scanner on the selected images:

import de.docutain.sdk.ui.ScanResult

val documentScanResult = registerForActivityResult(ScanResult()) { result ->
if(result){
//user finished scan process, continue with your workflow
//generate PDF by using Document.writePDF()
//get detected Text by using DocumentDataReader.getText()
//get data by using DocumentDataReader.analyze()
} else{
//user canceled scan process
}
}

myButton.setOnClickListener {
val scanConfig = DocumentScannerConfiguration()
scanConfig.source = Source.GALLERY_MULTIPLE
documentScanResult.launch(scanConfig)
}

Change default scan behaviour

DocumentScannerConfiguration

You can use the DocumentScannerConfiguration to alter the default scan behaviour to your needs. Currently the following values can be set:

PropertyTypeDefault ValueDescription
allowCaptureModeSettingBooleanfalseIf true, the document scanner toolbar will display an item that allows the user to switch between automatic and manual camera triggering.
autoCaptureBooleantrueIf true, the camera will capture the image automatically at the right moment.
defaultScanFilterScanFilterILLUSTRATIONThe default scan filter that will be used after scan.
onboardingImageSourceIntR.drawable.hint_edge_detectionYour custom image for the onboarding dialog that appears when scan is opened for the first time.
pageEditConfigPageEditConfigurationPageEditConfigurationConfiguration class used to alter the default page editing behaviour. See PageEditConfiguration for more details.
sourceSourceCAMERAThe source of the Document Scanner. If you need to run the scanner on imported images please see Scan from imported images for more details.
sourceImagesList<java.io.File>empty listPlease see Scan from imported images for more details.
autoCropBooleantrueIf true, image gets automatically cropped if document was detected. This applies only when importing images.
multiPageBooleantrueIf true, scanning multi page documents is possible. Set this to false if you need to scan single page documents.
preCaptureFocusBooleantrueIf true, the camera will run a focus action right before taking the image. This improves the quality of the scanned images, but depending on the device, image capture might take a little bit longer.
themeIntR.style.DocutainSDK_Theme_DefaultYour custom theme if you like to change colors to match your branding. Please see ColorConfiguration for more details.
textConfigTextConfigurationTextConfigurationConfiguration class used to alter the default text behaviour. See TextConfiguration for more details.
buttonConfigButtonConfigurationButtonConfigurationConfiguration class used to alter the default buttons. See ButtonConfiguration for more details.
confirmPagesBooleanfalseIf true, a list of all pages (thumbnails) will be displayed before the scan process can be finished.
allowPageEditingBooleantrueIf true, after the scan screen is finished, an editing screen with the captured images will be displayed. On the editing screen, the user can crop manually, rotate the page, filter the page and much more. The editing screen can be configured by accessing the PageEditConfiguration.
info

All parameters in DocumentScannerConfiguration are optional.


The following sample shows how to activate the confirmation mode:

import de.docutain.sdk.ui.ScanResult

val documentScanResult = registerForActivityResult(ScanResult()) { result -> }

...

myButton.setOnClickListener {
val scanConfig = DocumentScannerConfiguration()
scanConfig.confirmPages = true
documentScanResult.launch(scanConfig)
}

PageEditConfiguration

You can use the PageEditConfiguration to alter the default page editing behaviour of the document scanner to your needs. Currently the following values can be set:

PropertyTypeDefault ValueDescription
allowPageFilterBooleantrueIf false, the bottom toolbar will hide the filter page item.
allowPageRotationBooleantrueIf false, the bottom toolbar will hide the rotate page item.
allowPageArrangementBooleantrueIf false, the bottom toolbar will hide the arrange page item.
allowPageCroppingBooleantrueIf false, the bottom toolbar will hide the page cropping item.
allowPageRetakeBooleanfalseIf true, the bottom toolbar will show a button allowing to retake the current page.
pageArrangementShowDeleteButtonBooleanfalseIf true, each item of the page arrangement functionality will show a delete button.
pageArrangementShowPageNumberBooleantrueIf true, each item of the page arrangement functionality will show it's page number.
info

All parameters in PageEditConfiguration are optional.


The following sample shows how to activate the page retake button:

import de.docutain.sdk.ui.ScanResult

val documentScanResult = registerForActivityResult(ScanResult()) { result -> }

...

myButton.setOnClickListener {
val scanConfig = DocumentScannerConfiguration()
scanConfig.pageEditConfig.allowPageRetake = true
documentScanResult.launch(scanConfig)
}

TextConfiguration

You can use the TextConfiguration to alter the default text behaviour of the document scanner to your needs. If a value does not get set explicitly, the default value provided by the SDK will be used. If you set a text value to null, it won't show any text. You can use this to remove predefined text. Currently the following values can be set:

PropertyTypeDescription
textSizeBottomToolbarFloat?The text size of elements residing in the bottom toolbar.
textSizeTopToolbarFloat?The text size of menu items residing in the top toolbar.
textSizeScanButtonsFloat?The text size of the buttons in the scan page, located at the lower part, like the torch button.
textSizeTitleFloat?The text size of the title in the top toolbar. By default, auto shrinking down till 9.0 is enabled. If you define your custom size, automatic shrinking will be disabled.
textTitleScanPageString?The title to be displayed in the scan page top toolbar.
textTitleEditPageString?The title to be displayed in the edit page top toolbar.
textTitleFilterPageString?The title to be displayed in the filter page top toolbar.
textTitleCroppingPageString?The title to be displayed in the cropping page top toolbar.
textTitleArrangementPageString?The title to be displayed in the page arrangement page top toolbar.
textTitleConfirmationPageString?The title to be displayed in the confirmation page top toolbar.
textDocumentTitleString?The title to show in the top toolbar on all pages. It overwrites page specific titles, if any are set.
textOnboardingTitleString?The text to show in the onboarding popup title that appears when the scan page is opened for the first time.
textOnboardingMessageString?The text to show in the onboarding popup message that appears when the scan page is opened for the first time.
textOnboardingCloseButtonString?The text to show in the onboarding popup close button that appears when the scan page is opened for the first time.
textSizeOnboardingTitleFloat?The text size of the onboarding popup title that appears when the scan page is opened for the first time.
textSizeOnboardingMessageFloat?The text size of the onboarding popup message that appears when the scan page is opened for the first time.
textFocusHintString?The text to show when camera is focusing after capture got triggered.
textFirstPageHintString?The text to show when user swipes to previous page but is already at the first page.
textLastPageHintString?The text to show when user swipes to next page but is already at the last page.
textOnePageHintString?The text to show when user swipes to next or previous page but only one page is available.
textScanProgressString?The text to show in the progress popup that is shown when user clicks the done button but some pages still need to be processed.
textDeleteDialogCurrentPageStringThe text to show for the option to delete the current page within the dialog that will be shown when clicking delete while multiple pages are available.
textDeleteDialogAllPagesStringThe text to show for the option to delete all pages within the dialog that will be shown when clicking delete while multiple pages are available.
textDeleteDialogCancelString?The text to show for the option to cancel within the dialog that will be shown when clicking delete while multiple pages are available.
info

All parameters in TextConfiguration are optional.


The following sample shows how to set a document title:

import de.docutain.sdk.ui.ScanResult

val documentScanResult = registerForActivityResult(ScanResult()) { result -> }

...

myButton.setOnClickListener {
val scanConfig = DocumentScannerConfiguration()
scanConfig.textConfig.textDocumentTitle = "Custom Document Title"
documentScanResult.launch(scanConfig)
}

ButtonConfiguration

You can use the ButtonConfiguration to alter the default buttons of the scanner. Each button is an object of DocutainButton and has a title and icon property. Buttons residing in the top toolbar can have either a title or an icon. If you define both, the icon will be displayed. Buttons residing in the bottom toolbar can have both title and icon at the same time. If you want a button to only display text, set the icon property to null and the title property to the text you want to display. If you want a button to only display an icon, set the text property to null and the icon property to the icon you want to display.

Currently the following buttons can be set:

ButtonDefault ValueDescription
buttonEditRotatebuttonEditRotateThe button that rotates the current page.
buttonEditCropbuttonEditCropThe button that opens the cropping functionality.
buttonEditFilterbuttonEditFilterThe button that opens the filter functionality.
buttonEditArrangebuttonEditArrangeThe button that opens the page arrangement functionality.
buttonEditRetakebuttonEditRetakeThe button that starts the process of replacing the current page with a new scan.
buttonEditDeletebuttonEditDeleteThe button that deletes the current page or opens a dialog with options if multiple pages are available.
buttonEditFinishbuttonEditFinishThe button that finishes the scan process.
buttonCropExpandbuttonCropExpandThe button within the cropping functionality that expands the current cropping rectangle to the whole page.
buttonCropSnapbuttonCropSnapThe button within the cropping functionality that snaps the current cropping rectangle to the detected document.
buttonCropFinishbuttonCropFinishThe button within the cropping functionality that finishes the manual cropping process according to the current cropping rectangle.
buttonScanAutoCaptureOnbuttonScanAutoCaptureOnThe button within the scan functionality that is shown when automatic capture is activated.
buttonScanAutoCaptureOffbuttonScanAutoCaptureOffThe button within the scan functionality that is shown when automatic capture is deactivated.
buttonScanTorchbuttonScanTorchThe button within the scan functionality that toggles the torch.
buttonScanCapturebuttonScanCaptureThe button within the scan functionality that triggers a manual image capture.
buttonScanFinishbuttonScanFinishThe button within the scan functionality that finishes the current scan process and leads to the editing page.
buttonConfirmationFinishbuttonConfirmationFinishThe button on the confirmation page that finishes the scan process.
info

All parameters in ButtonConfiguration are optional.


The following sample shows how to customize the buttonEditRotate:

import de.docutain.sdk.ui.ScanResult

val documentScanResult = registerForActivityResult(ScanResult()) { result -> }

...

myButton.setOnClickListener {
val scanConfig = DocumentScannerConfiguration()
scanConfig.buttonConfig.buttonEditRotate.title = "Custom Title"
scanConfig.buttonConfig.buttonEditRotate.icon = R.drawable.icon
documentScanResult.launch(scanConfig)
}

ColorConfiguration

In order to fit the Docutain Scanner SDK for Android into your corporate design, you have a bunch of options to alter the default color theming of the ready to use UI components.

Supported colors

The following is a list of all colors currently supported.

ColorDefault ValueDescription
docutain_colorPrimarylight: #4CAF50
dark: #4CAF50
Used to tint progress indicators and dialog buttons. theming_dialog_button theming_progress_indicators
docutain_colorSecondarylight: #4CAF50
dark: #4CAF50
Used to tint selectable controls and the capture button. theming_selection_controls
theming_capture_button
docutain_colorOnSecondarylight: #FFFFFF
dark: #000000
Used to tint elements that reside on docutain_colorSecondary, like the icon of the capture button.
theming_capture_button
docutain_colorScanButtonsLayoutBackgroundlight: #121212
dark: #121212
Used to tint the background of the scan buttons layout.
theming_scan_buttons_layout
docutain_colorScanButtonsForegroundlight: #FFFFFF
dark: #FFFFFF
Used to tint the foreground of the scan buttons layout, like the torch button.
theming_scan_buttons_layout
docutain_colorScanPolygonlight: #4CAF50
dark: #4CAF50
Used to tint the polygon overlay which highlights the currently detected document.
theming_scan_polygon
docutain_colorBottomBarBackgroundlight: #FFFFFF
dark: #212121
Used to tint the bottom toolbar background of the image editing page.
theming_edit_bottombar
docutain_colorBottomBarForegroundlight: #323232
dark: #BEBEBE
Used to tint the buttons within the bottom toolbar of the image editing page. theming_edit_bottombar
docutain_colorTopBarBackgroundlight: #4CAF50
dark: #2a2a2a
Used to tint the top toolbar background. theming_topbar
docutain_colorTopBarForegroundlight: #FFFFFF
dark: #DEFFFFFF
Used to tint the buttons contained in the top toolbar. theming_topbar
docutain_colorTopBarTitlelight: #FFFFFF
dark: #DEFFFFFF
Used to tint the text of the top toolbar title. theming_topbar_title

Dark Mode

The Docutain Scanner SDK for Android also supports dark mode theming. The process is the same, just define different colors for the night version of your theme. The SDK decides which color to use depending on the device's sytem setting and it will change at runtime, if the device's display mode changes.

theming_edit_bottombar theming_edit_bottombar_dark

Defining a custom theme

In order to alter at least one of the colors mentioned above, you need to define a custom theme which uses DocutainSDK.Theme.Default as parent.

The following is an example which alters all currently available colors.

<style name="DocutainSDKTestApp.DocutainSDKTheme" parent="DocutainSDK.Theme.Default">
<item name="docutain_colorPrimary">@color/colorPrimary</item>
<item name="docutain_colorSecondary">@color/colorSecondary</item>
<item name="docutain_colorOnSecondary">@color/colorOnSecondary</item>
<item name="docutain_colorScanButtonsLayoutBackground">@color/colorScanButtonsLayoutBackground</item>
<item name="docutain_colorScanButtonsForeground">@color/colorScanButtonsForeground</item>
<item name="docutain_colorScanPolygon">@color/colorScanPolygon</item>
<item name="docutain_colorBottomBarBackground">@color/colorBottomBarBackground</item>
<item name="docutain_colorBottomBarForeground">@color/colorBottomBarForeground</item>
<item name="docutain_colorTopBarBackground">@color/colorTopBarBackground</item>
<item name="docutain_colorTopBarForeground">@color/colorTopBarForeground</item>
<item name="docutain_colorTopBarTitle">@color/colorTopBarTitle</item>
</style>

To tell the scanner to use your custom theme, set the theme attribute of the DocumentScannerConfiguration.

import de.docutain.sdk.ui.ScanResult

val documentScanResult = registerForActivityResult(ScanResult()) { result -> }

...

myButton.setOnClickListener {
val scanConfig = DocumentScannerConfiguration()
scanConfig.theme = R.style.your_custom_theme
documentScanResult.launch(scanConfig)
}

Result handling

After the scan process is successfully finished, you can do a bunch of things with the scanned pages:

Android PDF Creation

Android Text Detection

Android Data Extraction

Language Support

The device's locale determines the language used by the scanner.

The SDK provides default translations for certain languages. Refer to TextConfiguration if you want to set your own texts.

Currently available translations:

  • English
  • Arabic
  • Bulgarian
  • Chinese, Simplified
  • Chinese, Traditional
  • Croation
  • Czech
  • Danish
  • Dutch
  • Finnish
  • French
  • German
  • Greek
  • Hindi
  • Hungarian
  • Icelandic
  • Indonesian
  • Italian
  • Japanese
  • Korean
  • Lithuanian
  • Norwegian Bokmal
  • Polish
  • Portugese
  • Portugese (Brazil)
  • Romanian
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish
  • Swedish
  • Turkish

The fallback language is English. This means if the device is set to a language that is currently not supported, it will show English texts.

If you think the translation can be improved, please feel free to contact us via support.sdk@Docutain.com.