For Confluence on-premise collections, is it necessary to install anything on a server?
Yes, you must install an application on a Windows machine that is always on and has constant connectivity to the Confluence server and Internet. For more information, explore our guide to collecting from Confluence on-premise.
Can Onna collect Jira data from embedded Jira links within Confluence pages?
Even if the credentials are the same for both the Confluence and Jira accounts, Onna will not sync the data within the embedded Jira links. You must create a separate source within Onna to sync Jira data. Learn how to set up a Jira source in our Help Center.
Why is Admin access required for Onna to pull data from Confluence?
We request admin access for completeness in the collection. By authenticating with an admin user, we can ensure that all available spaces and pages are returned. A regular user may not have access to the space needed to collect or all of the pages in a space to collect.
Can a sync be configured only to collect a single Confluence page?
Yes, to sync a single Confluence page, paste the link of the single page in the hostname field. When configuring the source, you will receive a message asking if you want to sync a single page.
How do I collect from a Confluence Data Center?
Confluence Data Center is a self-managed edition of Confluence built for enterprises. Onna users can collect from the Data Center edition with the same features and limitations of a Confluence server.
Onna recommends setting up a collection through the Legacy API when ingesting a large volume of Confluence spaces or pages. With the Legacy API configuration, you can collect multiple single-page collections that may not be able to be set up through the user interface.
Tip: when ingesting a large volume of Confluence content, the Onna Support team suggests creating several Confluence data sources in Onna. Each source can be separated by Confluence spaces or pages.
How do I identify specific pages in exported Confluence data?
If you are trying to identify specific pages in exported Confluence data, there are a few fields you can leverage to find this information.
- Metadata_File name: This field includes the file name of the resource. The file name is based on the page's subject, and the format will be HTML by default, so this will help you identify pages by name.
- ResourceName: This field includes the Page ID. Page ID and Space ID are the unique identifier and representatives of each Space and Page. Users can edit the title of the Page anytime. Therefore, it's more content than a field. Please note, when reviewing ResourceName, results with pre-fix "att" are likely attachments of pages.
To identify the Space ID and a Page ID you can deconstruct the URL of the Page. In this example:
The Space ID is JT, and the Page ID is 2254897153