- 论坛徽章:
- 0
|
论文管理 docdb 和 docarc
PHP/Perl + PostgreSQL/MySQL + Apache 应用。 不知道porting 会不会相对简单些。 不知哪位前辈愿意试一下? 或者指点一下新手应该怎么做 (只看Porter handbook???), 注意些什么, 怎样才能避免一些错误? 万分感激。
Document Database (PHP + MySQL/PostgreSQL): http://docdb.sourceforge.net/
Document Archive (Perl + MySQL): http://docarc.sourceforge.net/
Welcome to the Document Database Web Site
Overview
Papers The Document Database was grown out of the need to organize research papers and their BibTex entries. The lab I'm working in used to have a master BibTex file that over the years grew to contain more than 600 entries and is now about 200 kByte in size. We had problems with multiple people, more or less BibTeX savvy, editing this file, duplicate entries, keeping track of the version of the file and so on. The Document Database was started to overcome these issues.
The Document Database is entirely written in PHP. It runs on Apache 1.3 and 2.0 webservers with integrated PHP and uses Postgres 7.3.x or 7.4.x or MySQL 4.1.x as database backend. The user authentication is handled using Apache's Basic Authentication. For users of Redhat or SuSE Linux it should work right out of the box. The Document Database is written modularly. All functions for modifying the authentication files are contained in one file. All functions for database access are contained in another file. It should therefore be easy to make the Document Database work on other web servers and with other databases.
You always get the latest release and info from the Project Page.
Features
Here are the main features of the Document Database
* User Account Management with groups for Admins and Users
* Entry of any current BibTeX Document type with context sensitive help function
* Simple search within any single or all fields
* Boolean Search with Query Builder (1.2)
* Last viewed list
* Emulates BibTeX layout for easy verification including limited support for foreign characters (1.2)
* Uploads of documents (read the Disclaimer)
* Uploads of files into a searchable filearea (1.2)
* BibTeX output for each entry
* User can share notes on documents (1.2)
* Supports multiple instances on the same machine (1.0.1)
* Robust and easy installation (1.0.1)
-------------------------------------------------------------------------------------------------------
Document Archive (docarc) is a database written in Perl to help you (and your workgroup) keeping track of the vast amount of electronic documents and BibTeX entries you might have in use. The underlying database is MySQL (because it's addressed through Perl's DBI, it might work with others as well).
It's meant to be an alternative to .bib files and a complicated directory structure of document files. Although it's a web-based application command line frontends to the most important features have been developed. Some work on integration into the great browser suites Mozilla and Firefox has also been done.
Live Demo (last updated 2005-02-02)
For those who want to look around a live demo is available here. Username / password combinations are available for all three user groups (no documents may be added as guest):
admin:admin, user:user, guest:guest
Please do NOT edit the fields / doctype structure. Password change is disabled, so you also can't create users. Due to restrictions of my provider, ht://Dig powered fulltext search is not available.
Access to the public part of the live demo is possible here. Public installations of Document Archive users are listed here.
Download
You may download the current version (0.9.4) at SourceForge.net Project Page.
Known bugs and patches for v0.9.4 can be viewed here.
Since the Mozilla plugin was renamed you have to delete the already installed plugin first (if version < 0.9.2) before updating it.
Documentation
1. FAQ [ps.gz]
2. Installation [ps.gz]
3. Configuration
1. Templates [ps.gz]
4. Internal Structure
1. Database [ps.gz]
This is a living document and I appreciate any questions (email or a forum, or any of the trackers Bugs, Support Requests, Patches or Feature Requests from the project page) regarding the installation process, configuration, or any other part that should be explained in detail. If you want to know or comment what is beeing implemented at the very time, have a look at the Task Manager.
Changes (0.9.5)
* arxiv layout has changed. a set of all current input filters can be downloaded here. replace the file parse_bibtex.pm and the directory parse_bibtex in your cgi-bin/modules directory.
* new input filters for Institute of Physics journals, Science Direct journals and PubMed
* new document type problemset for publishing exercise sheets to your students
* a new prx input filter has been written to compensate for changes in prl/pra/... layout. download prx.pl here and replace the old one in the modules/parse_bibtex directory
* completely rewritten almost everything. new features are
o multiple files can be attached to a bibtex entry
o document type of existing entries can be altered
o speed improvement
o pagewise browsing through category view of search result (ie. offset and limit are working)
o more comfortable edit the doctypes/fields structure
* user space metadata can be attached to any document (comments, private bibtexids or categories)
* speeded up search a little by optimizing sql queries
* introduced cli command upload to put a document file for a specified document into the database (eg. for use in makefiles to update your document entry)
* 1x1 document description fields are now represented by two radio buttons rather than a checkbox
* Content-Disposition header ensures correct filenames when downloading documents
* some pages now have HTTP Last-modified header so ht://Dig does not need to parse unchanged documents
* fixed database and style bugs and standards conformance
* after new entry submission user will be redirected to the corresponding entry view (in case he wants to add it to categories or file any meta-data)
Features
* widely configurable
* charset recode (bibtex, html, rdf)
* tree structure for content classification
* multiuser containers may be used to group project oriented documents
* public runmode allows readonly access to allowed documents for anyone without authorization
* fully themeable
* traditional (standard) theme
o fully css-configurable per central stylesheet for nearly every class of elements
o user customizable (some options that influence the handling)
o strictly xml conform (uses doctype xhtml 1.0 transitional)
o optimized for mozilla, but also runs under recent versions of other browsers. it's not planned to support obsolete browser versions because i'd like to concentrate on standards like xml, css and dom.
* simple layout that returns special pages for your internal search engine (eg. htdig)
* dynamic online help
* fields/doctypes structures and categories configurable via web interface
* command line interface (cli) for simple access during latex document compiling
* internal database search lets you specify complex search expressions
* fulltext search through document files available via integrated ht://dig support
* modular extensible. available extensions:
o frame and iframe based extensions of the traditional theme
o planned: scan2pdf interface (tiff/bmp/png/jpeg -> pdf)
o planned: content extraction (just have to upload a pdf file, database extracts the necessary content)
o planned: printing and format conversion capabilities (ghostscript, xpdf, lpr, psutils)
o planned: email support (send registered users bibtex entries or document files)
o planned: documentation of template interface
o planned: little integration into some famous editors (eg. vi, emacs, kile)
o planned: more browser integration (eg. internet explorer, opera, konqueror)
* mozilla/firefox integration (plugin and search engine)
o direct access to documents via toolbar
o select html embedded bibtex and post it directly to docarc via context menu
o let docarc parse the currently viewed web-page (many famous e-journals are supported)
o query document archive from sidebar or url input field
Valid XHTML 1.0!Valid CSS!
SourceForge.net Logo |
|